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Abstract 

This is a selfcontained set of lecture notes on instantons in (super) Yang-Mills theory 
in four dimensions and in quantum mechanics. First the basics are derived from scratch: 
the regular and singular one-instanton solutions for Yang-Mills theories with gauge groups 
SU(2) and SU(N), their bosonic and fermionic zero modes, the path integral instanton 
measure, and supersymmetric Yang-Mills theories in Euclidean space. Then we discuss 
applications: the #-angle of QCD, the solution of the U(l) problem, the way Higgs fields 
solve the large-instanton problem, and tunneling and phase transitions in quantum me- 
chanics and in nonabelian gauge theories. These lecture notes are an extension of a review 
on Yang-Mills and D-instantons written in 2000 by both authors and A.Belitsky [1]. 
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1 Introduction 



In the last decades enormous progress has been made in understanding nonperturba- 
tive effects, both in supersymmetric field theories and in superstring theories. By non- 
perturbative effects we mean effects due to solitons and instantons, whose masses and 
actions, respectively, are inversely proportional to the square of the coupling constant [2]. 
Typical examples of solitons are the kink, the vortex, and the magnetic monopole in field 
theory, and some D-branes in supergravity or superstring theories. In supersymmetric field 
theories these solutions preserve half of the supersymmetry and saturate BPS bounds. As 
for instantons, we have the Yang-Mills (YM) instantons in four dimensions [3, 4, 5], or 
tunnelling phenomena in quantum mechanics with a double-well potential as described by 
the kink, see e.g. [6], and there are various kinds of instantons in string theory, for example 
the D-instantons [7] . Also instantons preserve half the number of supersymmetries in su- 
persymmetric field theories. Instantons can also be defined in field theories in dimensions 
higher than four [8], but we discuss in this chapter mainly the case of four dimensions. 

Instantons in ordinary (i.e., nongravitational) quantum field theories are by definition 
solutions of the classical field equations in Euclidean space with finite action. 1 Only for 
a finite classical action S c \ is the factor exp[— ^S c \] in the path integral nonvanishing. 
We shall consider instantons in nonabelian gauge theories in flat spacetime (there are no 
instantons in abelian gauge theories in flat space), both regular instantons (which actually 
have a singularity at \x\ 2 = oo) and singular instantons (which have a singularity at a 
point x = x but not at \x\ 2 = oo). A singular gauge transformation maps the first into 
the second, and vice- versa 2 . Around a given instanton solution, there are the quantum 
fluctuations. The action contains terms with 2, 3, 4 ... quantum fields, and one can 
perform perturbation theory around the instanton. The terms quadratic in quantum fields 

1 In gravity there are various definitions of instantons: Einstein spaces with selfdual Weyl tensors, 
selfdual Riemann tensors, solutions of the Einstein equations with/without finite action etc. Since in gravity 
spacetime is part of the solution, one usually considers spacetime topologies which are different from that 
of R 4 . A selfdual Riemann tensor leads to an Einstein space (R^ = Ag M „) whose Einstcin-Hilbcrt action 
is either infinite (if the cosmological constant A is nonvanishing), or it only gets contributions from the 
Gibbons-Hawking boundary term [9]. In general, the semiclassical approximation of the Einstein-Hilbert 
action is not well defined due to the unboundedness of the action inside the path integral. To cure this, 
one probably has to discuss gravitational instantons inside a full theory for quantum gravity. For instanton 
solutions in flat space but using curvilinear coordinates (for example S 4 , or cylindrical coordinates) see 
[10]. 

2 For the "regular solution" , is finite on R 4 U oo = S 4 everywhere, but this docs not mean that it is 
regular. It is finite only because one can use two different patches to cover S 4 , and A ^ is regular in each 
patch. If one maps infinity to the origin by a space-inversion transformation (x^ = y^/y 2 ), then one finds 
a singularity at the origin. In this sense the "regular solution" is singular. We further clarify this issue in 
the next section. 
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yield the propagators, which are complicated background-dependent expressions, and the 
terms cubic and higher in quantum fields yield the vertices. However, there is a subtlety 
with an instanton background: there are zero modes. A zero mode is by definition a 
solution of the linearized field equations for the fluctuations which is normalizable. (It is 
an eigenfunction of the quantum field operator with eigenvalue zero). In a trivial vacuum 
there are no zero modes: there are, of course, solutions of the linearized field equations, but 
they are not normalizable. We must treat the zero modes in instanton physics separately 
from the nonzero modes; for example, they have their own measure in the path integral. 
The nonzero modes live in the space orthogonal to the zero modes and in this space one 
can invert the linearized field equations for the fluctuations and construct propagators, and 
do perturbation theory. 

Instantons describe tunnelling processes in Minkowski space-time from one vacuum at 
time ti to another vacuum at time £2- The simplest model which exhibits this phenomenon 
is a quantum mechanical point particle with a double-well potential having two vacua, 
or a periodic potential with infinitely many vacua. Classically there is no trajectory for 
a particle to travel from one vacuum to the other, but quantum mechanically tunnelling 
occurs. The tunnelling amplitude can be computed in the WKB approximation, and is 
typically exponentially suppressed. In the Euclidean picture, after performing a Wick 
rotation, the potential is turned upside down, and it is possible for a particle to propagate 
between the two vacua, as described by the classical solution to the Euclidean equations of 
motion. The claim is then that the contributions from instantons in Euclidean space yield 
a good approximation of the path integral in Minkowski space. We shall prove this for the 
case of quantum mechanics. 

Also in YM theories, instantons are known to describe tunnelling processes between 
different vacua, labeled by an integer winding number, and lead to the introduction of 
the CP-violating #-term in the action [11, 12]. It was hoped that instantons could shed 
some light on the mechanism of quark confinement. Although this was successfully shown 
in three-dimensional gauge theories (based on the Georgi-Glashow model) [13], the role 
of instantons in relation to confinement in four dimensions is less clear. Together with 
the non-perturbative chiral U(l) anomaly in an instanton background, which leads to 
baryon number violation and a solution of the U(l) problem [4, 5], instantons are used 
in phenomenological applications to QCD and the Standard Model. To avoid confusion, 
note that the triangle chiral anomalies in perturbative field theories in Minkowski space- 
time are canceled by choosing suitable multiplets of fermions. There remain, however, 
chiral anomalies at the non-perturbative level. It is hard to compute the non-perturbative 
terms in the effective action which lead to a breakdown of the chiral symmetry by using 
methods in Minkowski space-time. However, by using instantons in Euclidean space, one 
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can relatively easily determine these terms. The nonperturbative chiral anomalies are due 
to fermionic zero modes which appear in the path integral measure (in addition to bosonic 
zero modes). One must saturate the Grassmann integrals over these zero modes, and this 
leads to correlation functions of composite operators with fermion fields which do violate 
the chiral U(l) symmetry. The new non-perturbative terms are first computed in Euclidean 
space, but then continued to Minkowski space where they give rise to new physical effects 
[5] . They have the following generic form in the effective action (we suppress here possible 
flavor or adjoint indices that the fermions can carry) 



where 2n is the number of fermionic zero modes (n depends on the representation of the 
fermions and the gauge group). The prefactor is due to the classical instanton action and 
is clearly non-perturbative. The terms indicated by 0(g 2 ) are due to standard radiative 
corrections computed by using Feynman graphs in an instanton background. The term 
(AA) n involving antichiral spinors A is produced if one saturates the integration in the 
path integral over the fermionic collective coordinates and violates in general the chiral 
symmetry. On top of (1.1) we have to add the contributions from anti-instantons, generat- 
ing (AA) n terms in the effective action, where A denotes chiral spinors. As we shall discuss, 
for Majorana spinors in Euclidean space the chiral and anti-chiral spinors are independent, 
but in Minkowski space-time they are related by complex conjugation, and one needs the 
sum of instanton and anti-instanton contributions to obtain a hermitean effective action. 

We shall also apply the results of the general formalism to supersymmetric gauge the- 
ories, especially to the M = 4 SU(N) supersymmetric Yang-Mills (SYM) theory. Here 
M stands for the number of supersymmetries. Instantons in M = 1,2 models have been 
extensively studied in the past, see e.g. [14] for an early reference, and still are a topic of 
current research. For the M = 1 models, one is mainly interested in the calculation of 
the superpotential and the gluino condensate [15, 16]. In some specific models, instantons 
also provide a mechanism for supersymmetry breaking [16], see [17] for a review on these 
issues. In the case of M = 2, there are exact results for the prepotential [18] based only 
on general symmetry principles and electric-magnetic duality; the prepotential acquires 
contributions from all multi-instanton sectors. These predictions were successfully tested 
against direct field theoretical calculations in the one-instanton sector in [19], and for a 
two- instanton background in [20]. More recently, new techniques were developed to per- 
form multi-instantons calculations in [21]. Finally, the nonperturbative structure of M = 4 
SYM has been studied thoroughly in the context of the AdS/CFT correspondence [22]. 
SYM instantons in the limit of large number of colors were succesfully shown to reproduce 
the D-instanton contributions to certain correlation functions, both for single instantons 




(1.1) 
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[23, 24] and for multi-instantons [25]. Other correlation functions were studied in [26, 27]. 
For a recent review of instantons in supersymmetric gauge theories, see [28]. 

The material is organized as follows. In section 2, we discuss the winding number of 
gauge fields, and we present the standard one-instanton solution in SU(2) and in SU(N). 
This already raises the question how to embed SU(2) into SU(N), and we discuss the 
various embeddings. In section 3 we discuss instanton solutions in general: we solve the 
duality condition and find multi-instanton solutions which depend on their position and 
their scale. We concentrate on the one-instanton solutions, and first determine the singular 
solutions, but then we make a (singular) gauge transformation and obtain the regular 
solutions. In section 4 we start the study of "collective coordinates", the parameters on 
which the most general instanton solutions depend. We show that the number of collective 
coordinates is given by an index theorem for the Dirac operator in an instanton background. 
We then give a derivation of this index theorem, and conclude that a /c-instanton solution 
in SU (N) has 4Nk bosonic collective coordinates, 2Nk fermionic collective coordinates for 
fermions in the adjoint representation, and k fermionic collective coordinates for fermions 
in the defining (fundamental, vector) representation. In section 5 we explicitly construct 
the zero modes for gauge group SU (N) in a one-instanton background. First we construct 
the bosonic zero modes; these are associated to the collective coordinates for translations, 
dilatations and gauge orientations. Next we derive the explicit formula for the general 
solution of the fermionic zero modes of the Dirac equation in a one-instanton background, 
first for SU(2) and then for SU(N). 

In section 6 we construct the one-instanton measure for the bosonic and fermionic collec- 
tive coordinates. We explain in detail the normalization of the zero modes since it is crucial 
for the construction of the measure. We convert the integration over the coefficients of the 
bosonic zero modes to an integration over the corresponding bosonic collective coordinates 
by the Faddeev-Popov trick, but for fermionic zero modes we do not need this procedure 
because in this case the coefficients of fermionic zero modes are already the fermionic col- 
lective coordinates. In section 7 we discuss the one-loop determinants in the background of 
an instanton, arising from integrating out the quantum fluctuations. We then apply this to 
supersymmetric theories, and we use an index theorem to prove that the determinants for 
all supersymmetric YM theories cancel each other. Furthermore, we compute the complete 
nonperturbative (3 function for supersymmetric Yang-Mills theories by assuming that the 
measure for the zero modes does not depend on the renormalization scale /i. However, 
since it is not known to which regularization scheme this procedure corresponds, this re- 
sult cannot be checked by standard perturbative calculations. In section 8 we discuss the 
J\f = 4 supersymmetric Yang-Mills theory in Euclidean space and its instantons. 

The remaining sections contain applications. Section 9 discusses the problem of large 
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instantons and its solution in terms of Higgs fields and spontaneous symmetry breaking. 
Section 10 gives a detailed discussion how instantons can describe tunnelling. In section 
11 we use a quantum mechanical model with a double- well potential to discuss the phase 
transition from a false vacuum to the true vacuum by bubble formation. Section 12 contains 
the strong CP problem, the mystery that the 9 angle is so small. Section 13 discusses that 
instantons solve the U (1) problem and in section 14 we finally discuss how instantons lead 
to baryon decay. 

In a few appendices we set up our conventions and give a detailed derivation of some 
technical results in order to make the material self-contained. In appendix A we provide 
details of the calculation of the winding number. In appendix B we discuss the 't Hooft 
tensors and the spinor formalism in Euclidean space. In appendix C we calculate the 
volume of the moduli space of gauge orientations. Finally, in appendix D we show that 
conformal boosts and Lorentz rotations do not lead to additional zero modes. 

2 Winding number and embeddings 

We start with some elementary facts about instantons in SU (N) Yang-Mills theories. The 
action, continued to Euclidean space, is 

S = ~^J d^tri^, ; F, v = F« u T a . (2.1) 

The generators T a are traceless anti-hermitean N by N matrices satisfying [T a , T b ] = f a b c T c 
with real structure constants and tv(T a T b ) = —\5 a b- For instance, for 577(2) one has 
T a = — |r , where r a are the Pauli matrices and f a bc = £ a bc- Notice that with these 
conventions the action is positive. Further conventions are D^Y = d^Y + [A^, Y] for any 
Lie algebra valued field Y, and F^ u = d^A u — d u A fl + [A^, A v ], so that F^ u = [D^, D v \. The 
Euclidean metric is 5^ = diag (+,+,+, +). In (2.1), the only appearance of the coupling 
constant is in front of the action. The group metric g ab = 5 ab is an invariant tensor 3 , so 
we may raise and lower indices with 5 ab and 5 ab . Thus we may also write [T a , T b ] = f abc T c , 
and from now on we shall write group and Lorentz indices either as covariant indices or as 
contravariant indices, depending on which way is most convenient. 

By definition, a Yang-Mills instanton is a solution of the classical Euclidean equations 
of motion with finite action. The classical equations of motion read 

DpF^ = . (2.2) 

3 From tr[T c ,T a Tb] = tr([T c , T a ]Tt, + trT a [T Cl Tb]) it follows that g ab is an invariant tensor: transforming its indices 
by an adjoint transformation with parameter A c yields zero: 5g ab = \ c f ca d 9db + ^ c fcb d 9ac = 0. 
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To find solutions with finite action, we require that the field strength tends to zero at 
infinity faster than \x\~ 2 = r~ 2 , hence the gauge fields asymptotically approach a pure 
gauge 4 

A, W = °° U-%U , (2.3) 

for some U G SU(N). To prove that gauge fields are pure gauge if the curvature F^ v 
vanishes, is easy. Using Ud^U^ 1 = —d^UU^ 1 we must solve for U from d^U = —A^U, 
whose solution is the path-ordered integral U = exp[— J x A^(y)dy^]. This expression does 
not depend on the path chosen because F^ u = 0. (Note, however, that if two gauge field 
configurations, say A 1 ^ and Aj/ , yield the same curvature, F flu (A I ) = F^ u (A n ), they need 
not be gauge equivalent. A simple example proves this. Consider 

A^{-\B y T^\BxT^}- Al 1 = {A 1 , 1 = ^BTuAi 1 = VBT 2 ,0,0] , (2.4) 

where B is a constant and T a are the generators of SU (2) with structure constants f ab c = 
e abc . Clearly F 12 (A r ) = BT 3 and also F 12 (A H ) = BT 3 while all other components of F^ v 
vanish. To prove that A 1 ^ cannot be written as U^ 1 (d^ + A^U we note that if there was 
such a group element U, it should satisfy UF^U' 1 = F M „, hence U should commute with 
T 3 . This implies that U would be given by exp(/(x)T 3 ) for some real function f(x). Then 
— ^ByTz = d x fTz + e~^ T3 V / 5T 1 e~^ T3 which has no solution. One can also calculate a 
Wilson loop W = trPexp § Adl. This expression is gauge invariant, and if one chooses as 
loop a square in the x — y plane with sides L\ and L 2 , one finds 

W 1 = BL ± L 2 T 3 ] W n = 2v A B(LiT 1 + L 2 T 2 ) (2.5) 

If A 1 ^ and A 1 ^ were gauge equivalent, W 1 should have been equal to W 11 ). 

There is actually a way of classifying fields which satisfy the boundary condition in 
(2.3). It is known from homotopy theory that all gauge fields with vanishing field strength 
at infinity can be classified into sectors characterized by an integer number called the 
Pontryagin class, or the winding number, or instanton number, or the topological charge 

k = -T^~2 I d 4 ztriV^„ , (2.6) 



16tt 2 _ 

where *F flu = ^e^puFp^ is the dual field strength, and e 1234 = 1. Note that it is not 
necessary that these gauge fields satisfy the field equations, only that their field strength 



4 Another way of satisfying the finite action requirement is to first formulate the theory on a compactified 
R 4 , by adding and identifying points at infinity. Then the topology is that of the four-sphere, since 
R 4 U 00 ~ S A . The stereographic map from R 4 U 00 to S* 4 preserves the angles, and is therefore conformal. 
Also the YM action is conformally invariant, implying that the action and the field equations on R 4 U 00 
are the same as on S 4 (the metric on the sphere is — 6^(1 + x 2 )~ 2 ). The finiteness requirement is 
satisfied when the gauge potentials can be smoothly extended from R 4 to S* 4 . The action is then finite 
because S 4 is compact and is well-defined on the whole of the four-sphere. 
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vanishes sufficiently fast at r = oo. The derivation of this result can be found in Appendix 
A. As part of the proof, one shows that the integrand in (2.6) is the divergence of a current 

= ~g^2 e vp°~^ T A u ippA a + ^A p A a ^ . (2-7) 

The four-dimensional integral in (2.6) then reduces to an integral over a three-sphere at 
spatial infinity, and one can use (2.3) to show that the integer k counts how many times this 
spatial three-sphere covers the gauge group three-sphere S 3 SU(2) C SU(N). In more 
mathematical terms, the integer k corresponds to the third homotopy group n 3 (SU(2)) = 
Z. So k as defined in (2.6) does not depend on the values of the fields in the interior, but 
only on the fields at large \x\ 2 . This can also directly be seen: under a small variation 
A p — > A p + 5A P one has F^ u — > F^ v + Dp5A u — D U 5A P , and partial integration (allowed 
when 5A p is only nonzero in a region in the interior) yields bAyD^F^y which vanishes 
due to the Bianchi identity D^F up ^ = 0. (To prove this Bianchi identity one may use 
F vp = [D„,D P ]. In [Dp, [D v ,D p ]] + [£>„, [D p ,D^]] + [D p , [D^D U ]} there are then 12 terms 
which cancel pairwise.) 

Since we require instantons to have finite action, they satisfy the above boundary con- 
ditions at infinity, and hence they are classified by k, which we call the instanton number. 
Gauge potentials leading to field strengths with different instanton number can not be 
related by continuous gauge transformations. This follows from the fact that the instanton 
number is a gauge invariant quantity. In a given topological sector, the field configuration 
which minimizes the action is a solution of the field equations. (It is a priori not obvious 
that there exist field configurations that minimize the action, but we shall construct such 
solutions, thereby explicitly proving that they exist). We now show that, in a given topo- 
logical sector, the solution to the field equations that minimizes the action has either a 
selfdual or anti-selfdual field strength 

p — — p (o a\ 

This equation is understood in Euclidean space, where (*) 2 = 1. In Minkowski space there 
are no real solutions to the selfduality equations since (*) 2 = —1. As seen from (2.6), 
instantons (with selfdual field strength) have k > whereas anti-instantons 
(with anti-selfdual field strength) have k < 0. (Recall that trT a T b is negative). To 
see that minimum action solutions are indeed selfdual or anti-selfdual, we perform a trick 
similar to the one used for deriving the BPS bound for solitons: we write the action as the 
square of a sum plus a total derivative term 

S - ~J d 4 xtrF 2 = -^ J d 4 x tr (F =f *F) 2 =f ^-^ J d*xtr F*F 

~ T h\ d ^ F * F=8 f^- ( 2 - 9 ) 
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We used that tr*F*F = trFF and omitted Lorentz indices to simplify the notation. The 
equality is satisfied if and only if the field strength is (anti-) selfdual. The value of the 
action is then S c \ = (Sn 2 /g 2 )\k\, and has the same value for the instanton as for the 
anti-instanton. However, we can also add a theta-angle term to the action, which reads 

S e = -i-^ J d 4 xtr F^*F» U = i9k = ±i6\k\ . (2.10) 

The plus or minus sign corresponds to the instanton and anti-instanton respectively, so 
the theta-angle distinguishes between them. In Minkowski spacetime this term is the same 
because both d 4 x and F^ V *F^ V produce a factor % under a Wick rotation. We give a more 
detailed treatment of the theta-angle term and its applications in Section 12. 

It is interesting to note that the energy-momentum tensor for a selfdual (or anti-selfdual) 
field strength always vanishes 5 

Tfxv = -^tr {F w F vp - \5 pv F pa F pa ) = . (2.11) 

(Because in Euclidean space T 44 = — ^tr (E 2 — B 2 ), the Euclidean "energy" T 44 need not be 
positive definite). This agrees with the observation that the instanton action J d 4 xtrF 2 = 
J d A x tr *FF is metric independent in curved space. The vanishing of the energy-momentum 
tensor is consistent with the fact that instantons are topological in nature. It implies that 
instantons do not curve Euclidean space, as follows from the Einstein equations. 

An explicit construction of finite action solutions of the Euclidean classical equations of 
motion was given by Belavin et al. [3]. We shall derive this solution, and others, in section 
3, but to get oriented we present it here, and discuss some of its properties. The gauge 
configuration for one instanton (k = 1) in SU{2) contains the matrices o pv or a pv . One 
often writes it in terms of the 't Hooft 77 tensors, related to a pv by a pv = irf pu r a where 
r a are the generators of SU(2). We discuss these tensors in Appendix B. The regular 
one-instanton solution reads then 

A a (r- r n n) - o V a ^( x ~ x oY 

A » ~ A » \2i) ~ (x - x ) 2 + p 2 ' {2A2) 

where x and p are arbitrary parameters called collective coordinates. They correspond 
to the position and the size of the instanton. The above expression solves the selfduality 



5 Note that T 12 is proportional to t^Fisi^+i^i^), which is equal to minus itself due to the selfduality 
relations F 12 — -^34,^13 = — -F24 an d fi4 = ^23- Similarly Tn vanishes because it is proportional to the 
trace of (** + F? 3 + F^) - (F| 3 + F| 4 + F| 4 ). 
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equations for any value of the collective coordinates. Notice that it is regular at x = xq, 
as long as p ^ 0. The real antisymmetric eta-symbols are defined as follows 



V\u = eV //, v = 1, 2, 3 , r^% 4 = -?7% = ^ , 

»7V = e V W = !> 2 ' 3 > *7V = = • ( 2 - 13 ) 

The 77 and ^-tensors are selfdual and anti-selfdual respectively, for fixed index a. They 
form a basis for antisymmetric four by four matrices, and we have listed their properties 
in Appendix B. They are linear combinations of the Euclidean Lorentz generators L^ u , 
namely rf ^ = (J a + K a )^ and fj%„ = (J a - K a )^, where J a = e abc L bc and K a = L a4 , 
and (L mn )^ u = 5 mfl 5 nu — 5 mv 5 njJi with m,n — 1,4. In this subsection we use rj tensors, but 
in later sections we shall use the matrices a^ u and a^. 

The field strength corresponding to this gauge potential is (use (B.5)) 

P 2 

^ uv m^TT v> i <m ' (2-14) 

^ [{X — XqY + p 2 \ 2 

and it is selfdual. Thus (2.12) is a solution of the classical field equations. Far away, 
A ^ becomes proportional to the inverse radius \ so that it contributes a finite amount to 
the integral for the winding number which is of the form J A 3 (r 3 dQ), while F^ u becomes 
proportional to ^, yielding a finite action. However, A a ^ itself vanishes at r — > 00, hence 
we have a smooth configuration on S 4 . Notice that the special point p — 0, corresponding 
to zero size instantons, leads to zero field strength and corresponds to pure gauge. Strictly 
speaking, this point must therefore be excluded from the instanton moduli space of col- 
lective coordinates. Finally one can compute the value of the action by integrating the 
density 

tri^F"" = -96 - pA . (2.15) 

l(x - x y + p 2 j 4 

Using the integral given at the end of Appendix B, one finds that this solution corresponds 
to k = 1. 

One may show by direct calculation that the regular one-anti-instanton solution is also 
given by (2.12) but with fj a ^ u . (In the proof one uses that the first formula in (B.5) also 
holds for ffnv). 

We shall also derive the one-instanton solution in the singular gauge. In terms of 77 
symbols it reads 

This gauge potential is singular for x = Xq, where it approaches a pure gauge configuration 
as we shall show in the next section, A^ Ud^U~ l . The gauge transformation U is 
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singular and relates the regular gauge instanton (2.12) to the singular one (2.16) at all 
points. The field strength in singular gauge is then (taking the instanton at the origin, 
x = 0, otherwise replace x — > x — x ) 



Notice that despite the presence of the anti-selfdual eta-tensors fj, this field strength is 
still selfdual, as can be seen by using the properties of the eta-tensors given in (B.5). The 
singular gauge is frequently used, because, as we will see later, zero modes fall off more 
rapidly at large x in the singular gauge. One can compute the winding number again in 
singular gauge. Then one finds that there is no contribution coming from infinity. Instead, 
all the winding is coming from the singularity at the origin. The singular solution is 
singular at Xq, so one would expect that the regular solution is singular at infinity. This 
may seem puzzling since we saw that the regular solution was smooth on S A . However, to 
decide whether a configuration is smooth at r — > oo, one should first transform the point at 
infinity to the origin and then study how the transformed configuration behaves near the 
origin. Making the coordinate transformation x M = y^/y 2 or x^ = —y^/y 2 , not forgetting 
that a vector field transforms as A' (y) = (dx u /dy fl )A l/ (x), one finds that the transformed 
regular k — 1 solution is indeed singular at the origin 6 . In fact, it is equal to the singular 
k — — 1 solution with p replaced by K 

At first sight it seems that there are five collective coordinates for the k — 1 solution. 
There are however extra collective coordinates corresponding to the gauge orientation. One 
can act with an SU(2) matrix on the solution (2.12) to obtain another solution, 



with constant 6. One might think that these configurations should not be considered as 
a new solution since they are gauge equivalent to the expression given above. This is 
not true, however, the reason being that, after we fix the gauge, we still have left a rigid 
SU(2) symmetry which acts as in (2.18). So in total there are eight collective coordinates, 
also called moduli. In principle, one could also act with the (space-time) rotation matrices 
SO (A) on the instanton solution, and construct new solutions. However, these rotations can 
be undone by suitably chosen gauge transformations [29]. Actually, the Yang-Mills action 
is not only invariant under the Poincare algebra (and the gauge algebra), but it is also 

6 This coordinate transformation in R 4 can be viewed as a product of two conformal projections, one 
from the plane to the coordinate patch on the sphere S 4 containing the south pole, and the other from 
the other coordinate patch on S* 4 with the north pole back to the plane. The transformed metric is 
d'tiuiu) = S^v/v \ so conformally flat. Then the action for the A'^(y) in ^/-coordinates is again the usual 
flat space action in (2.1), and the transformed instanton solution is an anti-instanton solution. 




(2.17) 



A,(x;x ,p,6) = U- 1 (6)A,(x;x ,p)U(6) , U e SU(2) . 



(2.18) 
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invariant under the conformal algebra which contains the Poincare algebra and further the 
generators for dilatations (D) and conformal boosts (K^). As shown in Appendix D, for the 
Euclidean conformal group 5*0(5, 1), the subgroup 5*0(5) consisting of 50(4) rotations and 
a combination of conformal boosts and translations (77 M = K fi +p 2 P fi ), leaves the instanton 
invariant up to gauge transformations. This leads to a 5 parameter instanton moduli space 
50(5, l)/50(5), which is the Euclidean version of the five-dimensional anti-de Sitter space 
AdS 5 . The coordinates on this manifold correspond to the four positions and the size p of 
the instanton. On top of that, there are still three gauge orientation collective coordinates, 
yielding a total of eight moduli for the k — 1 instanton in SU(2). 

Instantons in SU(N) can be obtained by embedding 577(2) instantons into SU(N). For 
instance, a particular embedding is given by the following 77 by 77 matrix 



where the instanton resides in the 2x2 matrix on the lower right. Of course this is not 
the most general solution, as one can choose different embeddings, see below. 

One can act with a general SU(N) element on the solution (2.19) and obtain a new 
one. Not all elements of SU(N) generate a new solution. There is a stability group that 
leaves (2.19) invariant, acting only on the zeros, or commuting trivially with the SU(2) 
embedding. Such group elements should be divided out, so we consider, for 77 > 2, 



One can now count the number of collective coordinates. From counting the dimension of 
the coset space in (2.20), one finds there are 47V — 5 parameters. Together with the position 
and the scale of the SU (2) solution, we find in total 47V collective coordinates for a one- 
instanton solution in SU(N). It is instructive to work out the example of 577(3). Here we 
use the eight Gell-Mann matrices {A a },a = l,...,8. The first three A a , a = 1, 2, 3, form an 
577(2) algebra and are used to define the k = 1 instanton by contracting (2.12) or (2.16) 
with A a . The generators A4, . . . , A7 form two doublets under this SU(2), so they act on 
the instanton and can be used to generate new solutions. This yields four more collective 
coordinates. Then there is A 8 , corresponding to the U(l) factor in (2.20). It commutes 
with the SU(2) subgroup spanned by A , and so it belongs to the stability group leaving 
the instanton invariant. So for SU(3) and k — 1, there are seven gauge orientation zero 
modes, which agrees with 47V — 5 for 77 = 3. 

The embedding of instanton solutions as a 2 x 2 block inside the 77 x 77 matrix rep- 
resentation of SU(N) is not the only embedding possible. For example, one can also use 




(2.19) 




U e 




(2.20) 
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the 3x3 matrix representation T a of 577(2), and put the instanton inside a 3 x 3 block of 
the N of SU(N). This 3 of SU(2) is sometimes called "the other SU(2) in S77(3)", but it 
is simply the adjoint representation of SU(2), which is also the defining representation of 
50(3), and is given by (T a ) ij = e iaj , 
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(2.21) 



This representation has the same structure constants f a bc = tabc as the representation 
T a = r a /(2i), but now tr{T a T b } = —25 a b, four times larger. 

In fact, going back to the construction of the instanton, we note that any representation 
T a of SU(2) yields an instanton solution for SU(N) as long as it fits inside the N x N 
matrices of SU(N) [30] 

K = 2 ^^f? • (2 ' 22) 

The 2 of SU{2) with T a — || yields (2.19), but any other representation yields another 
embedding. 

For SU (3) there are only two possibilities. We can embed the instanton using the 2 of 
S77(2); this yields (2.19). But we can also use the matrices T a given in (2.21) as the first 3 
generators of SU(3). For SU(N) we can use any spin j representation of SU(2) provided 
it fits inside the N x N matrices. Since the action and winding number are proportional 
to the trace tr T a T b , which is proportional to the quadratic Casimir operator j(j + 1) times 
the dimension 2j + 1 of the spin j representation 7 , we see that we get instanton solutions 
with winding number k = ±|j(j + 1) (2j + 1). For j = 1/2 this reduces to k — ±1. For 
the first few SU(N) the results are as follows 

517(3) : k = ±1; k = ±4 (j = 1/2 and j = 1) 
577(4) : k — ±1; k — ±4; k — ±10 (j = 1/2, 1, 3/2) 

k = ±2 (two j = 1/2 in block form) 
SU(5) : k — ±1, ±4, ±10, ±20 (j = |,1, |, 2) 

k = ±2, ±5 (j = \ | and j = \ 1) . (2.23) 

All these instanton solutions with winding number \k\ > 1 still are (anti-) selfdual, so they 
still have minimal action, determined by the winding number, so the same as k instantons 
embedded as 2 x 2 matrices but far apart. Two instantons far apart and each of the form 

7 Use 5 ab trT a Tb = —trC2(R) = — (2j + l)C2(i?) where the quadratic Casimir operator for the represen- 
tation R with spin j is given by C 2 (R) = -S ab T(R) a T(R) b = j(j + 1). 
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(2.12) repel each other (as opposed to an instanton and anti-instanton) with an interaction 
energy proportional to 1/r. Bringing k instantons together such that they sit all at the same 
point, gives solutions of the kind above. So far apart there is a small positive interaction, 
but when they are brought together the interaction energy vanishes. Hence, there must be 
domains of attraction in between. This already shows that the interaction of instantons is 
a complicated problem [30]. In fact, one can deform these single-instanton solutions such 
that a multi-instanton solution is obtained in which the single-instantons do not attract 
or repel each other. In other words, in such a multi-instanton solution the positions, sizes 
and gauge orientations of the single instantons are collective coordinates. 

For the general multi-instanton solution, the dependence on all collective coordinates 
is in implicit form given by the ADHM construction [31]. For a recent review, see [32]. 
In the next section we will obtain explicit formulas for the dependence on 5k collective 
coordinates. Explicit formulas for the dependence on all collective coordinates only exist 
for the k = 2 instanton solution [31, 33, 34, 32] and the k = 3 instanton solution [35]. 

We end this section with some remarks on embeddings into other gauge groups [36]. For 
k = 1 and gauge group SO(N), it is known that there are 4N—8 collective coordinates. This 
can be understood as follows. The one-instanton solution is constructed by choosing an 
embedding of 5*0(4) = SU (2) x SU (2) generated by jf and f\ a , and putting the instanton 
in one of the SU(2) groups. The stability group of this instanton is SO(N — 4) x SU(2), 
so we obtain (for N > 4) 

A » ~ \0 Af (2) ) ' G SO(N-4) x SU{2) ' ^ 

The number of collective coordinates of such solutions follows from the dimension of the 
coset (which is 4N — 13). Including the positions and size of the SU(2) instanton, we arrive 
at 4iV — 8 for the total number of collective coordinates. Notice that for N = 6, we can use 
the isomorphism between SO(6) and SU(4). For both countings, we arrive at 16 moduli. 

Similarly, we can analyze the symplectic gauge groups USp(2N). Here we can simply 
choose the lower diagonal SU(2) = USp(2) embedding inside USp(2N) for a k — 1 in- 
stanton. The stability group of this embedding is now USp{2N — 2), so for we have the 
following instanton solution: 

A » ~ U { Af (2) ) ' USp(2N - 2) ■ {2Jb) 

The dimension of U Sp(2N) is iV(2iV-|-l), 8 and so the total number of collective coordinates 
that follows from this construction is 5 + (4iV — 1) = 4(iV + 1), which is the correct number 



3 Thc dimension of U (2N) is AN 2 and the generators have the form ( 1 . 1 I where aj is 




15 



[36]. For N = 2, we have the isomorphism USp(2) = SO(5), which in both countings leads 
to 12 collective coordinates. 

For higher instanton number, not all instantons can be constructed from a properly 
chosen embedding. There the ADHM formalism must be used. We just mention here 
that the total number of collective coordinates is AkN,Ak(N — 2) and Ak(N + 1) for the 
gauge groups SU(N),SO(N) and USp(2N) respectively. The geometric relation between 
instanton moduli spaces and quaternionic manifolds (whose dimension is always a multiple 
of four) can e.g. be found in [37]. 

2.1 Some remarks on nonselfdual instanton solutions 

Note that we have not shown that all solutions of (2.2) with finite action are given by 
selfdual (or anti-selfdual) field strengths. In principle there could be configurations which 
are extrema of the action, but are neither selfdual nor anti-selfdual 9 . For the gauge group 
SU (2) this has been a long standing question. The first result was established in [38, 39, 40] 
where it was shown that for gauge groups SU{2) and 577(3), nonselfdual solutions cannot 
be local minima, hence if they exist, they should correspond to saddle points. The existence 
of nonselfdual solutions with finite action and gauge group SU (2) was first established in 
[41], for k — 0, and later for k ^ in [42]. For gauge group SU(3) some results have 
been obtained in [43, 44]. The situation seems to be quite complicated, and no elegant and 
simple framework to address these issues has been found so far. For bigger gauge groups, 
it is easier to construct non-selfdual (or anti-selfdual) solutions. This becomes clear in 
the example of SO (A) = SU(2) x SU(2). If we associate a selfdual instanton to the first 
factor, and an anti-selfdual instanton to the second factor, the total field strength satisfies 
the equations of motion (2.2) but is neither selfdual nor anti-selfdual. Even simpler is the 
example of S77(4). By choosing two commuting 577(2) subgroups, we can embed both an 

antisymmetric and s, is symmetric. Complex symplectic matrices M = I ^ ^ ] satisfy M T fl + QM = 



N 2 + N(N - 1) constraints (D + A T = and C - C T = B - B T = 0). 

9 It is possible to construct solutions for SU{2) that are not selfdual, but not with finite action. An 
example is = —^cr^u^i- Its field strength is F^ v = ^cr^/r 2 + ^(x^a w — x v 'a f _ Lp )x p / V 4 . One can check 
that it satisfies the second order equation of motion (2.2) (both d^F^ and [A^^F^] vanish), but this 
configuration is not selfdual since F^ v — *F^ V = jcr^^p-. Because this field strength does not tend to zero 
fast enough at infinity, the action evaluated on this solution diverges logarithmically. 



C D 




The restriction that the unitary generators be also symplectric leads to 
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SU{2) instanton and an anti-instanton inside SU(A), 



Af W = [ f ° ) , (2.26) 



A: 



where A^ denotes the (anti-) selfdual SU(2) gauge potentials with topological charges k ± . 
Clearly the total field strength is neither selfdual nor anti-selfdual, but satisfies the second 
order equations of motion. The instanton action is finite and the total topological charge 
is k + — k~. 

From the embedding (2.26) one can generate more solutions by acting on the gauge 
potential with a global gauge transformation U G SU(A). In this way, one generates new 
exact and nonselfdual solutions which are not of the form (2.26). 

For SU(N) gauge groups, one has even more possibilities. One can embed k + instantons 
and k_ anti-instantons on the (block)-diagonal of SU(N), as long as 2(k + + k_) < N . If 
we take both k + > and > 0, the solution is clearly not selfdual or anti-selfdual and 
the instanton action, including the theta-angle, is given by 

S=^-(k+ + k-)+ie(k+-k-) . (2.27) 

In a supersymmetric theory, these solutions will not preserve any supersymmetry. This is 
interesting in the context of the AdS / CFT correspondence that relates M = 4 SYM theory 
to type IIB superstrings. In [45] , it is shown that these non-selfdual Yang-Mills instantons 
are related to non-extremal (non BPS) D-instantons in IIB supergravity. 



3 Regular and singular instanton solutions 

To find explicit instanton solutions, we solve the selfduality (or anti-selfdualty) equations 
Fpv = where *F (U/ = |e^ pCT F pCT with /i, v = 1,4 and e 1234 = e 1234 = 1. Since *F /XU 
vanishes identically due to the Bianchi identity, we then have a solution of the field equa- 
tions, D^F^ U = 0. The main idea is to make a suitable ansatz, and then to check that it 
yields solutions. The ansatz is (we restrict ourselves for the moment to the gauge group 
SU(2)) 

A^x) = a a^dy In 0(x 2 ) , (3.1) 

where a is a real constant to be fixed and a^ u is the 2x2 matrix representation of the 
Lorentz generators in Euclidean space. Since we shall be using these matrices a^ v a lot, 
we first discuss their properties in some detail, and then we shall come back below (3.22) 
to the construction of instanton solutions. 
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3.1 Lorentz and spinor algebra 

In Euclidean space, a suitable 4x4 matrix representation of the Dirac matrices is given 
by 

{ i(<7")«7J J ' a» = (r, -iJ) ' 1 • ) 

where f are the Pauli matrices. We use slashes instead of dots on the spinor indices to 
indicate that we are in Euclidean space. All four Dirac matrices are hermitian, and satisfy 
{t^jT"} — 25^. The matrix 7 s is diagonal 

7 5 - 7V7V = ( -/ ) ' (3 ' 3) 

and chiral spinors correspond to projections with |(1 ± 75) which yield the upper or lower 
two components of a nonchiral four-component spinor. 

Since we are in Euclidean space, it does not matter whether we write the index /i as a 
contravariant or covariant index. In Minkowski space this representation (with 7 4 replaced 
by 7 where 7 4 = 27°, so that ( / y k ) 2 = +1 but (7 ) 2 = — 1) is used for two-component spinor 
formalism. Four-component spinors are then decomposed into two-component spinors as 
ib = (_"), and this explains the position of the spinor indices on and in (3.2). 
The Euclidean Lorentz generators (5*0(4) generators) acting on 4-component spinors are 
= 1 (7^71/ — 7^7^) and satisfy the Euclidean Lorentz algebra 

[M^, M pa \ = 5 up M pa - b va M w - 5 w M va + 5 pa M up . (3.4) 

However, this representation is reducible: the upper and lower components of ib form 
separate representations 

(3.5) 

In terms of <x M and cr M we then find the following two inequivalent spinor representations 
of SO (A) : Mp V = ±<j^ and M pu = \o pv , where 

cr^ = \{o»o v - a u a^) ■ a» v = \{a^a v - a u a") . (3.6) 

(It is customary not to include the factor \ in M pv = \<Jp V into the definition of a^ u ). 

The matrices a pv and a pu satisfy some properties which we shall need repeatedly. First 
of all, they are anti-selfdual and selfdual, respectively 
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This follows most easily by noting that the matrices 7 M satisfy 7^7^ = — \tpvpalplalz 
where 7^ = 7^7^ = |(7 M 7^ - lulp)- For example, 7172 = -737475 because ei 23 4 = +1- 
From this (anti) -self duality one derives another useful property 

tpvpa&UT ^pr^vp &VT&pp ^pr^pv ■ (3-8) 

It is easiest to prove (3.8) by substituting (3.7) into the left-hand side, and decomposing 
the product of two e-tensors into a sum of products of Kronecker tensors. Another proof 
is based on the "Schouten identity" which is the observation that a totally antisymmetric 
tensor with 5 indices vanishes in 4 dimensions (because there are always at least two indices 
equal). Writing the left-hand side of (3.8) as e Ml//0Q 5 J g T <7 ay g and using the Schouten identity 

tpvpafi/3 tfivpafop ~\~ ^p/3pa^u ~\~ ^pvfia^p ~\~ tpvpfi&a , (3-9) 

the identity (3.7) can be used to prove the property (3.8) (the last term in (3.9) yields 
minus the contribution of the term on the left-hand side). In a similar way one may prove 

tpVpa&CFT fopT&Vp ~\~ ^VT®pp 3pT&pU- (3.10) 

The extra overall minus sign is due to the extra minus sign in the selfduality relation in 
(3.7). 

Further identities are the commutator of two Lorentz generators, and the anticommu- 
tator which is proportional to the unit matrix in spinor space 

CFpa] = l&vpOpa + three more terms , 
Wpu, o- pa } = 2(5 lia 5 l/p - 5 W 5 VIJ ) + 2e puprT . (3.11) 

One easy way to prove or check these identities is to use 4x4 Dirac matrices; for example 
{7172,7374} = 275 and {712,713} = but {712,712} = -2 and [712,713] = -2723- Because 

75 = ^ ^ j ^ , it is clear that the a pv satisfy the same commutation and anticommuta- 

tion relations, but with a different sign for the e symbol. In particular, 

\Ppv-, &pa} = 2{5pa5vp - Sppbva) ~ ^pupa ■ (3-12) 

All these identities can also be derived using two-component spinor formalism for vectors. 
For example, a vector t> M is written as v aa ' = v^a^ 010 ' , and then one may use such identites 

as 

5p u ~ Saos^ ~ 5/5^'] 5 pv ~ e at 3e al p, . (3.13) 

If one never introduces any vector indices at all but only uses spinor indices, this spinor 
formalism turns about all identities into trivialities, but we prefer to also keep vector indices 
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around. The other extreme is to expand a^ v and a^ u into Pauli matrices r a as a^ u = if]^ u T a 
and = ir)° u T a where rf^ v and fj® u are constructed from e a ij and 5 a i tensors, as in (2.13). 
A whole calculus of these '"t Hooft-tensors" can be set-up, and is often used. We discuss 
it in appendix B. We shall not limit ourselves to one of these extremes; proofs are given 
either by using 2-component spinors or 4 x 4 Dirac matrices, depending on which approach 
is simplest for a given problem. 

The index structure of the ansatz for in (3.1) merits a short discussion. A Lie-algebra 
valued gauge field has indices i, j for a representation R of an SU (N) group. For SU(2) 
the generators in the defining representation are the Pauli matrices r a divided by 2i, hence 
Afj, = (ApYj = A® (§-) V The ansatz for the instanton can then be written as 

= (<V)'j x v f{x 2 ) . (3.14) 

The indices fi,v are Lorentz indices, but the indices i,j are SU(2) indices. Hence the 
matrix (cr^Yj carries simultaneously spacetime indices and internal SU(2) indices. The 
matrices a^ v are indeed proportional to r Q , a^ v = ifj° u T a , as one may check for specific 
values of ji and u, using 

(a,Y P ' = {r,i} ,(^) w = {f,-0. (3.15) 

The matrices rf^ and fj® u are actually invariant tensors of a particular SU (2) group. There 
are three groups SU(2): the gauge group SU(2) g and the rotation group 5*0(4) = SU (2) L x 
SU (2) R generated by rf^ and ff^ v . The tensor rf^ v is invariant under the combined SU (2) g 
gauge transformations acting on the index a generated by e abc , and the SU(2) L Lorentz 
transformations generated by . Indeed, under infinitesimal variations with parameter 
A a we find, using (B.5), 

Sr] a = e ab Cr] b X c + \x c L [rf, v a ] = if Xg = X a L . (3.16) 

Furthermore, rf^ v is separately invariant under the SU{2)r subgroup of the Lorentz group 
generated by fj b ; this follows from [r] a ,fj b ] = 0. In fact, rf = L a 4 + \e abc L bc and ff = 
-L a 4 + \t abc L bc from which [rj a : fj b ] = easily follows. 10 

Spinor indices are raised and lowered by e-tensors following the northwest-southeast 
convention: v a ' = € a 'P'vp> and v a = e^vp. So {a^Y' 01 = e 13 ' 5 ' e ay (a^)s'y There are various 
definitions of these e tensors in the literature; we define 

e af3 = -e a ' p '. (3.17) 

10 The 4x4 matrices L a 4 and L bc have entries (L a a)ixv = ^"S^ and {Lbcj^v = bb^cu — S c ^Sb u - They 
form the defining representation of the Euclidean Lorentz algebra. 
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Note that numerically e a/3 = e a/ 3 but also e a '^' = e^p* because one needs two e tensors to 
raise or lower both indices of an e tensor. We fix the overall sign by e af3 = e tj where e 12 = 1. 
A crucial relation in the spinor formalism which we shall frequently use is 

&H,a'i = CTn,ia' 5 (3.18) 

where we recall that a^ ia i = a^ 3 ' €ji€p/ a > . 

Using 2-component spinor indices for vectors, 

(a^) a , a A p = A a , a and (a 1/ )p> j x l/ = xpj , (3.19) 

the ansatz for the instanton solution in (3.1) with spinor indices for A p becomes 

(o'^)a'a(A^) j = A a i a j = ((X^) a ' a(°" /j,u) j% f ( x ) 

= {€S>P'J - ecpeajj?} f(x 2 ) = {5lx alj + e aj x\,} f(x 2 ) . (3.20) 

The trace over (ij) clearly vanishes, and this fixes the relative sign. We worked out the 
matrix {^) a , a (cr p U ) 1 ,j using 

-<>*< = 2<£>« , (3.21) 

and 

& a'a( & »)l3'j = & a'a( a ^)jP' = K'a G f e kj e ~i' P' = 2e a> /3'taj ■ (3.22) 

3.2 Solving the selfduality equations 

Let us now come back to the construction of instanton solutions. Substituting the ansatz 
for A p in (3.1) into the definition of F pv yields with (3.11) 

= aa vp d^d p In - aa w d v d p In + 

= [ptOypd^dp In - /j, <-> v) + 2a 2 (crp a d u In d a In - /j, <-> v) 

- 2aV^(91n0) 2 . (3.23) 

We want to solve the equation F pv = *F IU> . The dual of F pv can be written as an expression 
without any e tensor by using the identities for ep Vpa <j aT and ep V p a a p(J in (3.7) and (3.8). 
One finds 

Fpy 2 tfivprjFptj OLdp V p a (j aa d pd a In 

+ 2a 2 6p Up(T (Tp (3 d (T \n(f) dphi<f> - a 2 e p U pa cr pcT (d In cf)) 2 

= o V p[ptd p dp In — 2c? dp In d p In 0) — // <-> v 

+ <v(a<9 2 ln0) . (3.24) 
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Equating F^ u to *F fll/ yields two equations for 0, namely one for the terms with o vp and 
the other for the terms with a 



ad^dp In i 



2a 2 <9 M ln0 <9 p ln< 



ad^dp In < 



2a 2 d„ In 6 d„ In < 



-2a 2 (d In 0) 2 = a<9 2 ln< 



(3.25) 



The first equation is identically satisfied (for that reason we equated F^ u to +*F M „), while 
the second equation can be rewritten as <9 2 ln0 + 2a(<91n0) 2 = 0. For a — \ it simplifies 
to d 2 4>/4> = 0. (Setting cc = | is not a restriction because rescaling — > 1 / 2 " achieves the 
same result). 

Setting = ^2 yields for x ^ a solution: <9 2 0/0 = x 2 dp(—2x^ fx 4 ) = 0. However, this 
is also a solution at a; = because <9 2 :r~ 2 is proportional to a delta function (note that the 
dimensions match) and x 2 5 A (x) = 

d 2 \ = -Ak 2 5\x) . (3.26) 
x 2 

To check the coefficient, we integrate over a small ball, which includes the point x = 0; we 
obtain then j d 2 ^d i x = j r^drdVL^d^ = j r 3 dfi M (— 2x fl /x 4 ) = —4ir 2 . (The surface of a 
sphere in 4 dimensions is 27T 2 ). 

We have thus found a selfdual solution 

-,2 



Ap(x) = §o>ci In 



1 + 



(3.27) 



(x — a) 2 

We have added unity to in order that Ap(x) vanishes for large \x\. A more general 
solution is given by A^x) = \a ^ u d u In with 



^ (x - a,) 2 



(3.28) 



which also solves <9 2 0/0 = 0. These are a class of /c-instanton solutions, parameterized by 
5k collective coordinates. In particular for k — 1 we find the one-instanton solution 

r 2 

1 + 



A 



snig , 



X) 



= -<J„ 



\op U d v In 



[x — a) 



p 2 (x 



a 



(k — 1, singular) . 



(3.29) 



^ (:r-a) 2 ((:r-a) 2 + p 2 ) 

This solution is clearly singular at x — a, but one can remove the singularity at x = a by 
a singular gauge transformation (which maps the singularity to x 2 = oo). To determine 
this gauge transformation we first study the structure of the singularity. Near x = a the 
singular solution becomes 



A s * ng (x) —Ofn, 



x — a) 
[x — a) 



(3.30) 
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which is a pure gauge field with U(x — a) in (A. 9) 

U dJJ = -o^j^ — ; U{x) = y= — = ta^Xf./Vx 2 . (3.31) 



(x — a) 2 ' 

Note that U is unitary, and U' 1 equals —ia p x p /\fx 2 ) which follows from the property 
a p a p + a p a p = 25 Pfl . 

From (3.29) and (3.30) it follows that we can write A s * ng as 

4f<*> = (x -a ? + p ^ U ■ < 3 ' 32 > 
It is now clear that an opposite gauge tranformation removes the singularity at x = 

A^ix) = U(d, + Af*)U-i = d.UU'i (-1 + {x _ ^ 2 + ^ 

= (Ud.U- 1 ) -. {x \f t - . (3.33) 



The expressions [/ 1 9 M ?7 and Ud p U 1 are closely related; in fact, one finds by direct 
evaluation 

u9 ^ - • < 3 - 34 > 

Thus the regular one-instanton solution is given by 

A 7 = -**» 7 {X \fl 2 (* = 1, regular) . (3.35) 
^ (x — a) 2 + 

Of course, the singular and the regular solution are both self dual, because self-duality is 
a gauge-invariant property, but the field strengths differ by a gauge transformation. Setting 
a = for simplicity, one finds for the field strengths in the regular and singular gauge 

.2 



F™J> = 2<v [a , 2 + p2]2 ( & = 1 , regular) , 

— = ^-i^eg ^ = ^ (fc = ^ singular) . 



,2 (3.36) 

. , ( h = 1 «incrnlnri 

Vx~ 2 (x 2 + P 2 ) 2 

It is clear that F™ g is selfdual because a pv is selfdual, but also F^ lg is selfdual 11 as is clear 
from acting with e^ vpa on a^. 



llT Jsing sonic further identities which follow from the results for [7 M „,7 P ] and {7 Ai! y,7 P } 

GpGvp ^pu^p &p,p@v ^pu po® a j O p,i/Q p — &vp@p $p,p&v tpvpa^a i 

GpvGp = $vp@p &pp@v ~t~ ^pvpa^a \ G pO pv = &pp@v &pv&p ^ppva^a > 

one finds for the k = 1 singular solution 

2p 2 



> iv ~ ( X 2 + n 2 ) 2 ^ X 2 <V + ^2 *W + ° V ) 



(x 2 + p 2 ) 2 \ X 
In this form the self duality is no longer manifest. 
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The action for the one-instanton solution is, of course, proportional to the winding 
number 

S =-^j tri^i^d 4 * = ~J trF^d 4 * = ^ . (3.37) 

The same result is obtained by direct evaluation of this integral. 

The anti-instanton (the solution with k — — 1) is closely related to the instanton solution. 
Recall that we derived the instanton solution by making the ansatz = aa^dy In 0, 
evaluating and *F M „ in terms of matrices, and then setting F^ v = *F ilv . For the 
anti-instanton solution we make the ansatz = /3a^ u d u ln0. The expression for F^ u is 
unchanged (except that A^ contains a^ u instead of a^ u ), but the a^ u are selfdual instead 
of anti-selfdual, hence the expression for e^ upa a ar has opposite signs from e^ Uf „a aT . The 
equation with d^dplacf) again cancels if F^ u = — *F flUl which leads to opposite winding 
number (A; = —1). The other equation is again <9 2 ln0 + 2/3(<91n0) 2 = 0, hence (3 — \ and 
again 0=1 + J2iLi ix-a-y ^ ms Y ie ^ f° r the singular-gauge anti-instanton solution 

A 7' = -^ (x _^~_% + pl] ■ <* = "I. (3^8) 
Setting again temporarily a = 0, we find near x = 

Af 1 s w -a^/rr 2 = Ud^U' 1 , (3.39) 
with the same U = ia^x^j \fx? as before. Similarly as for the instanton, we have 

* x 2 + p 2 

AT = U-\d» + Af*)U 



d.u-'u ( -^— 2 - 1 

\ar + p 2 



x 2 



Using the expression for U^d^U in (3.31) one finds 

The curvatures for the anti-instanton solution are obtained by interchanging a pu and 
in the instanton solution 

= 2^ [(x _ Q ) 2 2 + p2]2 (* = -1, regular) . (3.42) 

So, the only difference between the instanton and anti-instanton solutions is the exchange 
between and in F^ u and For the instanton solution, F™ g and A™ g depend 
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on cr MI/ , but A^ ing depends on a^, and F*™ g also depends on (setting a = again for 
notational simplicity), 



= U = ^ (* = -!, singular) . (3.43) 



If one evaluates the product of the a matrices as in footnote 11, one finds an expression 
for F s ™ g in which the anti-self duality is no longer manifest. 



4 Collective coordinates, the index theorem and fermionic zero 
modes 

We found in section 2 one-instanton solutions (k = 1) in SU(N) with 4N parameters. 
The question arises whether these are all the solutions. To find this out, one can consider 
small deformations of the solution, + 5A^, and study when they preserve self duality. 
Expanding to first order in the deformation, and using that the variation of a curvature is 
the covariant derivative of the variation of the gauge field, this leads to the condition 

Dp5A v - D V SA^ = \D^5A V - DJAJ , (4. 1) 

where the covariant derivative depends only on the classical solution but not on 5A^. In 
addition we require that the new solution is not related to the old one by a gauge trans- 
formation. This can be achieved by requiring that the small deformations are orthogonal 
to any small gauge transformation D^A, for any function A, i.e. 

J d 4 xtr{( J D M A)M M } = . (4.2) 

This certainly rules out deformations of the form 5A^ = D^A. After partial integration 
the orthogonality requirement leads to the usual gauge condition in the background field 
formalism 

D^SA" = . (4.3) 

At this point the reader may start feeling uneasy because the conditions (4.1) and (4.2) 
may seem too strong. First of all, the deformation should be a solution but need not be 
(anti-) selfdual. Furthermore, the field equation for the fluctuations consists of the sum 
of a classical piece and a piece from the gauge fixing term, so that, requiring each part to 
vanish separately may seem too restrictive. However, one can prove the following general 
result [46]. Arbitrary solutions of the fluctuations around an (anti-) instanton which are 
square-integrable so that they do not change the winding number, are themselves also 
(anti-) selfdual and transversal. To prove this property, note that the field equations 
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for the fluctuations read D P F^ U (A + 5 A) + D V (D^5A P ) = 0. The second term comes 
from the gauge-fixing term. Taking the D v derivative, the first term vanishes while the 
second term yields D 2 (D fl 5A p ) = 0, hence D^5A p on-shell. The terms in the classical 
action which are quadractic in the fluctuations can be written as —\{f^ v — * f^v) 2 where 
fn V = D P 5A U — D U SA P . The minimum of the action yields a solution, hence f pu = * f pv 
on-shell. Thus imposing (4.1) and (4.3) is not too restrictive. 

The requirement that 5A P be square integrable is due to the fact that the inner product 
of zero modes SA p will later give us the metric or moduli space, which in turn will give 
us the integration measure of the moduli space. Also, for the index theorem which will be 
used to determine the number of zero modes, one needs the L 2 norm for fluctuations. It is 
remarkable that the zero modes which satisfy the differential equations in (4.1) and (4.3) 
are all square integrable. 

In references [46, 36] the solutions of (4.1) subject to the condition (4.3) were studied 
using the Atiyah-Singer index theorem. Index theory turns out to be a useful tool when 
counting the number of solutions to a certain linear differential equation of the form DT = 
0, where D is some differential operator and T is a tensor. We will elaborate on this in 
the next subsection and also when studying fermionic collective coordinates. The ultimate 
result of [36] is that there are 4Nk solutions, leading indeed to 4N collective coordinates 
for k — 1 [46]. An assumption required to apply index theorems is that the space has to 
be compact. One must therefore compactify Euclidean space to a four-sphere S* 4 , as was 
already discussed in footnote 4. 

4.1 Bosonic collective coordinates and the Dirac operator 

In this section we will make more precise statements about the number of solutions to the 
selfduality equations by relating it to the index of the Dirac operator. The problem is 
to determine the number of solutions to the (anti-) selfduality equations with topological 
charge k. For definiteness we consider anti-instantons, so we look for deformations which 
satisfy an anti-selfduality equation. 

As explained in the last subsection, we study deformations of a given classical solution 
A^ + 5A p . Let us define (j) p = 5A p and f pu = D p (f) u — D u (p p . The covariant derivative here 
contains only A^. The constraints can then be written as 

o^D^ = ; Drfp = , (4.4) 

which are 3 + 1 relations. Indeed, more explicitly, {o lJLU ) a a ' are 3 Lie-algebra valued 
expressions because a, a' — 1, 2 and tia^ v = 0. To prove the first relation, multiply by a pcr 
and take the trace. Since the trace of [a pa , a pv \ vanishes, while {a pa , a^} = 2(5 pa 5 pu — 
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tiva&pii) — Ztpa^u, one finds D a (p p — D p <p a — t^poD^v = 0, which is the anti-selfduality 
condition (3 relations). Both equations can be written as one simple equation as follows: 

a^a u D p (j) u = , (4.5) 

because a p cr u = 5 pu + a^, and the spinor structures of 5 pu and are independent. 
Introducing two-component spinor notation with 

P = Vp, a >pD p = D a ,p ; af<t> v = , (4.6) 

the deformations of an anti-instanton can be written as follows 

= D a ,^' = . (4.7) 

Note that is in the adjoint representation, so (4.7) stands for d a ipQ^' + [A^, $ /37 '] 
= 0. Using the explicit representation of the matrices a p in (B.9), we can represent the 
quaternion $ by 

with a and b complex adjoint- valued functions. Then (4.7) reduces to two spinor equations, 
one for 

A = Q ; p\ = 0, (4.9) 

and one for ia 2 X* = . Conversely, for each spinor solution A to the Dirac equa- 

tion, one may show that also ia 2 X* is a solution. (Use (<r M )* = — a 2 a^a 2 ). Indeed, if 
A yields a deformation (5Ai, 5A 2 , 5A 3 , 8A4), then ia 2 X* corresponds to the deformation 
(SA' 1: 5A' 2 , 5A' 3 , SA' 4 ) with SA[ = -6A 3 , 5A' 3 = 8A U 5A' 2 = 5A A and 5A' 4 = -5A 2 . They are 
not related by a Lorentz transformation because the coordinates x M are not transformed. 
Thus given A, we obtain two linearly independent deformations of the (anti-) instanton. 
As we already stressed, the spinors A are in the adjoint representation. We shall discuss 
other representations later. 

Given a solution A of the spinor equation, one can still construct two other solutions of 
the deformation of the anti-instanton, which differ by a factor % 



,<•> = « \ , = » . (4.10) 





The reason we do not count iX as a different solution for the spinors but treat and 
<3>( 2 ) as independent has to do with reality properties: 5A^ should be real, and and 
$( 2 ) yield different variations 5A p . Namely, a = 3 + i0 4 and b = 0i + i(f) 2 , so 

: 5A 4 = 04 , 5A 3 = 03 , 8 Ax = 0i , 5A 2 = 2 , 
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$ {2) : 8A 4 = 03 , 5A 3 = -04 , SA! = -02 , 5A 2 = fa 



(4.11) 



It may seem miraculous that we find a second solution without any hard work, but closer 
inspection reveals that no miracle is at work: under the substitutions SAi — > SA 2 , SA 2 — > 
—5Ai, 5A 3 — > 5A 4 , 5A 4 — > — 5A 3 , one of the anti-selfduality equations is exchanged with the 
gauge condition, and the other two duality equations get interchanged. Also for solitons 
this way of counting zero modes is encountered: for example for vortices one complex 
fermion zero mode corresponds to two real bosonic zero modes [47]. 

In fact, because $ < - 2 - ) = Q^ia^, one might wonder whether <3>( 3 ) = $^^(— iai) and 
$( 4 ) = $( x )(— ia 2 ) yield further solutions. One obtains 




(4.12) 



which are just the $ < - 1 ^ constructed from a 2 \* and ia 2 X*. So there are no further inde- 
pendent solutions [46]. Therefore, the number of solutions for $ is twice the number of 
solutions for a single two-component adjoint spinor. So, the problem of counting the num- 
ber of bosonic collective coordinates is now translated to the computation of the Dirac 
index, which we discuss next. 



4.2 Fermionic moduli and the index theorem 

Both motivated by the counting of bosonic collective coordinates, as discussed in the last 
subsection, and by the interest of coupling Yang-Mills theory to fermions, we study the 
Dirac equation in the background of an anti-instanton. We start with a massless four- 
component complex (Dirac) fermion ip, in an arbitrary representation (adjoint, fundamen- 
tal, etc) of an arbitrary gauge group 

lv P$ = pi> = . (4.13) 

We recall that a Dirac spinor can be decomposed into its chiral and anti-chiral components 




A^(l+ 7 5 )^, x=l(l-^)ij. (4.14) 



We use the Euclidean representation for the Clifford algebra discussed before 

7 " = (i _i r')- ^=^=(l°i)- (4i5> 

In Euclidean space the Lorentz group decomposes according to 5*0(4) = £77(2) x 577(2). 
The spinor indices a and a' correspond to the doublet representations of these two 5£7 (2) 



28 



factors. As opposed to the case of Minkowski space, A Q and x' a are n °t i n complex-conjugate 
representations. The Dirac equation then becomes 

P\ = 0, Px = 0, (4.16) 

where p and p are two-by-two matrixes, see (4.6), and A and x are independent complex 
two-component spinors. We now show that in the presence of an anti-instanton, (4.16) 
has zero modes for A, but not for x- Conversely, in the background of an instanton, p 
has zero modes, but p has not. A zero mode is by definition a solution of the linearized 
field equations for the quantum fluctuations which is normalizable. The fermionic fields 
are treated as quantum fields (there are no background fermionic fields), so normalizable 
solutions of (4.16) are zero modes. 

The argument goes as follows. Given a zero mode x f° r P, it also satisfies p px — 0- 
In other words, ker p C ker {p p} where ker denotes the kernel. Next we evaluate 

P P = a^ u D^D u = D 2 + ^<vF M „ , (4.17) 

where we have used er M er„ + over M = 25^, and <j^ v was defined in (3.6). But notice that the 
anti-instanton field strength is anti-selfdual whereas the tensor is selfdual, so the second 
term vanishes. From this it follows that x satisfies D 2 x = 0. Now we can multiply D 2 x 
with its conjugate x* an d integrate to get, after partial integration and assuming that the 
fields go to zero at infinity 12 , / d 4 x \ D^x\ 2 = 0. From this it follows that x is covariantly 
constant, D^x — 0, and so F^x = 0. Since F M „% = F^ u T a x, with T a the generators of 
the gauge group SU (2) in a representation R, we conclude that F£ v (x) T a x{ x ) must vanish 
at all points x. Since F* v is proportional to r\ aiiv (or f] afMU ), and rj afiu rj bfiu is proportional to 
5 a b, we find that T a x(x) vanishes for all a and all x. Then D^x — reduces to d^x — 0, 
and this implies that x — 0- We conclude that px has no square-integrable solutions. 
Stated differently, — D 2 is a positive definite operator and has no zero modes. Note that 
this result is independent of the representation of the fermion. 

For the A-equation, we have p p\ = 0, i.e. ker p C ker [p p}, and we obtain 

PP = D 2 + l -a^ v F llu . (4.18) 

This time the second term does not vanish in the presence of an anti-instanton, so zero 
modes cannot be ruled out. In fact, there do exist fermionic zero modes, because we 
shall construct them. Knowing that p has no zero modes, one easily concludes that 
ker|> = ker {p p) and ker^ = ker {p p) = 0. 

12 Normalizability of zero modes requires that % tends to zero faster than 1/r 2 (usually like 1/r 3 or 
sometimes 1/r ). Then the boundary term with x*^t^X indeed vanishes. 
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For massive spinors no zero modes are possible. To prove this one may repeat the same 
steps as for massless spinors, but now one finds that pX = imx and px = —imX, and 
iteration yields p px = m 2 x- The crucial observation is that m 2 is positive, while p p is 
negative definite. Hence, no zero modes exist for massive spinors. 

Now we can count the number of solutions using index theorems. The index of the Dirac 
operator is defined as 

Ind^ = dimker^^-dimker^^) . (4.19) 

This index will give us the number of zero modes, since the second term is zero and since 
any renormalizable solution of p pX = satisfies pX = as we have shown. There are 
several ways to compute its value. We begin by writing the index as follows 

f M 2 M 2 1 

Ind p = lim Tr < — — -= j^— } , (4.20) 

^ m^o X-pp + M 2 -pp + M 2 ) ' V ; 

where M is an arbitrary parameter. The trace Tr stands for a sum over group indices 
and spinor indices, and includes an integration over space-time. We shall discuss that this 
expression (before taking the limit) is independent of M. This implies that the operators 
p p and p p not only have the same spectrum but also the same density of states for 
non-zero eigenvalues 13 . That they have the same non-zero eigenvalues is clear: if ip is an 
eigenfunction of p p, then pip is an eigenfunction of p p with the same nonvanishing 
eigenvalue and pip does not vanish. Conversely, if ip is an eigenfunction of p p with 
nonzero eigenvalue, then pip does not vanish and is an eigenfunction of p p with the same 
nonvanishing eigenvalue. 

To show that (4.20) is independent of M 2 , we rewrite the index in terms of four- 
dimensional Dirac matrices, 

f M 2 1 

I(M 2 ) = Ind p = Tr | _ ^ 2 + ^ 75 ) , (4-21) 

where now p = . 

^4X4= I ' I (1-22) 




We rewrote the trace of the two terms in (4.20) over a two-dimensional spinor space as the 
trace of one term over a four-dimensional spinor space. It has been argued that indepen- 



13 One can also (as is customary in the literature) place the system in a large box to discretize the 
spectrum, and let the boundary conditions for the eigenfunctions of P p determine the boundary conditions 
for the eigenfunctions of p p, and vice- versa, such that the non-zero eigenvalues are the same. Such a 
treatment for the kink has been worked out in detail in [47]. However, in the limit of infinite volume, the 
densities of states can become different, as we shall discuss. 
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dence of M 2 follows by taking the M 2 -derivative (see [46]), 



I(M 2 ) = -Tr I - ,/\, \ ■ ( I 23) 



dM 2 v ' {(-ft 2 + M 2 ) 2 

Using that 75 anticommutes with ft and that the trace is cyclic, we find 

- Tr lpl = Tr ^l =Tr lpi t (4.24) 
A A A v ' 

where A = (— jp 2 + M 2 ) 2 . Hence Tr(ft 2 ^ 5 /A) would seem to vanish and this would prove 
that I(M 2 ) is independent of M 2 . The problem with this proof is that one can give a 
counter example: one can repeat all the steps for the supersymmetric kink, and this would 
then imply that the densities for chiral and anti-chiral fermion modes are equal. However, 
one can directly calculate these densities for the supersymmetric kink, and one then finds 
that they are different [48] 

^ = TO ' (4 ' 25) 

where m is the mass of the fluctuating fields far away from the kink. Applied to the case 
of instantons, the situation was considered in [46]. In [49, 50, 51] it was noted that the 
proof of [46] was incomplete. Cyclicity of the trace (on which the proof in [46] that Ap(k 2 ) 
vanishes is based), breaks down due to the presence of massless fluctuating fields 14 . One 
can directly compute I(M 2 ), using a more detailed index theorem [50, 51], and then finds 
that the densities of chiral and antichiral fermionic modes in an instanton background are 
equal, 

Ap(k 2 ) = for instantons . (4.26) 

Given that the density of states of the operator ft ft in (4.20) is the same as the density 
of states of the operator ft ft, there is a pairwise cancellation in (4.20) coming from the sum 
over eigenstates with non-zero eigenvalues, both for the discrete and continuous spectrum. 
So the only contribution is coming from the zero modes, for which the first term in (4.20) 
simply gives one for each zero mode, and the second term vanishes because there are no 
zero modes. The result is then clearly an integer, namely dimjker^}. Since J(M 2 ) is 
independent of M 2 , one can evaluate it in the large M 2 limit instead of the small M 2 limit. 
The calculation is then identical to the calculation of the chiral anomaly, which we now 
review. 



14 One would expect that at the regularized level the trace is cyclic but one may expect that one should 
also regularize infrared aspects of the problem. Consider for example quantum mechanics for a harmonic 
oscillator with mass term \m 2 q 2 . Define a = i/yg+ip/v / 2m and a 1 = yT^q — ip/ ^J2m. For to tending to 



zero, the vacuum is annihilated by p+ 0(m) but the vacuum becomes non-normalizablc when m vanishes. 
Still, at finite to, tr\p, q] = 0. 
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The chiral anomaly is equal to the regulated trace of the matrix 75. It can be written 

as 

r m- m 2 

Ind Tp = tr / dx < x I = = \x > , (4.27) 

^ J 1 + M 2 -pp + M 2 V ; 

where tr denotes the trace over group indices and spinor indices. Because 75 = diag(+l, +1, —1, — 1) 
one finds in (4.27) a relative minus sign between the first and the second term. We have 
chosen a quantum mechanical representation for the trace in a Hilbert space spanned by 
the eigenfunctions \x > of the position operator. The operators depend on the opera- 
tors x^ and the operators p M . When x reaches \x >, it becomes a c-number x. Similarly 
Pn\ x >= — j gfjrk >• The latter statement follows by contracting with a complete set of 
momentum eigenstates, using that < k\p^ = hk^ < k\ because p^ is hermitian 



< k I p^ I x >= Tik^ < k I x >= hkfj, 



; (2tt) 2 

h d e~ lkx h d , , , , — h d 



< k\x>=<k\—- — I x > . (4.28) 
% dx» (2tt) 2 tfe' 1 « &f v ' 

So, from now on we will replace the operators D^x^Px) by D^(x, —j-§^)- These act on 
the x in \x > and do not act on \k >. 

Let us now insert a complete set of eigenstates of pp and pp, respectively. The index 
becomes then 

Ind |> = tr j Ax<x\ n L X n L | O l | % X m L | * > - same with L » R , 

(4.29) 

where Ol is the first operator in (4.27) and Or the second. As we already discussed the 
eigenfunctions < x\til >= ip^h\x) and < x\hr >= (pi^\x) have the same nonvanishing 
eigenvalues \ n and the same densities. 

So the eigenfunctions with nonzero eigenvalues do not contribute to the index. (Note 
that it does not make sense to look for eigenfunctions of p or p because these operators 
change the helicity of the spinors). There are in general a finite number of zero modes in 
the L sector but none in the R sector. Hence 



\ n m 



l R) (x)^(xT 



M 2 S mn 
A 2 +M 2 



+ E / dW a L \^L L \xy = n (L) , (4.30) 

where cp^ (x) are the (square-integrable) zero modes, and is the number of these. A 
sum over spinor indices is taken in (4.30). 
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To actually compute the index (namely, to compute the integer n^), we use momentum 
eigenstates instead of eigenfunctions of pp and pp 

Ind p = J d 4 x J d 4 k J d 4 k' tr < x | k' >< k' \ I k >< k \ x > , 

(4.31) 

... / n -iff) \ 

where we recall that 75 = and p = 





As we have discussed, the operator D p = + [A^(x), ■] acts on the coordinates x in 
\x > but not on the k in \k >< k\, and the trace tr sums over the group indices and the 
spinor indices of 7^ in p = 7^D M and 75. Using < k\x >= e~ lkx /(2ir) 2 and pulling these 
plane waves to the left, the derivatives act on the c- numbers x in e~ lkx and are replaced 
by d^-ik^. The matrix element < k'\M 2 (-p 2 + M 2 )' 1 ^ \ k > is equal to < k! \ k > times 
the operator [M 2 /(— p 2 + M 2 )]^y 5 and < k'\k >= 5 4 (k — k'). When the plane wave e~ lkx has 
been pulled all the way to the left, the plane waves e lk ' x and e~ lkx in < x\k! >= e tk ' x / (2tt) 2 
and < k\x >= e~ tkx / (27i) 2 cancel each other, and one is left with 



The denominator can be written as 

1 



(p + M2) _ (_2iA: • D + D^Dn + | 7 „7^ ' 



(4.33) 



and we can exhibit the M 2 dependence by rescaling k^ = Mk^, yielding 




Indf = [ d 4 xM 4 [ T^rtr J 1 ^75 \ . (4.34) 

J J (27r) 4 I ( K 2 _|_ -\\ ( 2« M D M , D^Djx , 1 7m7^mA f V ' 



I M ^ I 

M M 2 2 M 2 



Expanding the denominator, only terms due to expanding two, three or four times can 
contribute in the limit M — > 00, but only the terms with at least four Dirac matrices 
can contribute to the trace due to the matrix 75. Thus we only need retain the square of 
\lnlvF^ v1 and the index becomes 

/[ d 4 K 1 
^ X J ( 27r )4( K 2 + l)3 tr {\ F ^l^\F palf „ lb ) 

f 2n 2 f°° r 3 dr 
= J d4x (2nT I pTIF KtrTaT6)(tr7 ^ 7pCT75) ^^ 

d 4 x-^- 2 (tTT a T b )(e^ pa F^F b pa ) , (4.35) 



/ 
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where we used that / df2^ = 2tc 2 and J °° = \- Note that both a trace over group 

indices and a trace over spinor indices has been taken. The result for the index is twice 
the product of the winding number in (2.6) and a group theory factor 

\ndp = 2^- 2 J d A xF a ^ *F»^j tr T a T b . (4.36) 

For a representation R of SU(N) for the fermions, we define tr T^T^ = —S a bT(R). By 
definition one has T(R) = | for the fundamental representation, and then T(R) = N for 
the adjoint representation 15 . Hence, finally, 

Ind p = \k\ for the fundamental representation, 

= 2N\k\ for the adjoint representation . (4.37) 

(For an anti-instanton, k is negative. The factor 2 corresponds to our earlier observation 
that ia 2 X* is also a zero mode if A is a zero mode.) Furthermore, as shown in the last 
subsection, an (anti-) instanton in SU (N) has twice as many bosonic collective coordinates 
as there are fermionic zero modes in the adjoint representation. This proves that there are 
4Nk bosonic collective coordinates for an instanton with winding number k and gauge 
group SU(N). 

5 Construction of zero modes 

In two later sections we will show how to set up and do (one-loop) perturbation theory 
around an (anti-) instanton. This will require the reduction of the path integral measure 
over instanton field configurations to an integral over the moduli space of collective coor- 
dinates. In order to achieve this we need to know the explicit form of the bosonic and 
fermionic zero modes. This is the content of this section. We follow closely [52]. 

15 To compute T(R) for the adjoint representation, write the carrier space for the adjoint representation 
of SU(N) as u% - j f 5 i j {u k v k ). Then, for i ^ j, T'fiu'v, = (T^Yeu 1 '^ + (T/* )/Vu,-,. For a diagonal 
generator A of the fundamental representation of SU(N) with entries (ia\, . . . ,ioin) with real otj one has 
Au l = ia l u l and Au l Vj = {ia l — ic^)u l Vj, so A(u l vj — j^SjU k Vk) — (ia l — ia^){u l Vj — jj5jii k Vk) and 
J2 a i = 0- Hence tr A 2 = — J2iLi( at ) 2 f° r the fundamental representation, but tr^4 2 = — J2i j{ a% ~ a ^) 2 
for the adjoint representation. The latter sum can also be written as 

N 

£ (a, a 3 f = (£ a() N - 2 a,) (£ a,) + (£ a?) N = 2 (£ a*) N . 

i >.?'=1 

So T(i? ad j) = 2NT{Rf). 
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5.1 Bosonic zero modes and their normalization 

In order to construct the bosonic zero modes and discuss perturbation theory, we first 
decompose the fields into a background part and quantum fields 

A^AfW+A?. (5.1) 

Here 7 j denote a set of collective coordinates, and, for gauge group SU (N), % — 1, . . . , ANk. 
Before we make the expansion of the action, we should first fix the gauge and introduce 
ghosts, c, and anti-ghosts, b. We choose the background gauge condition 

Dt A T = • ( 5 - 2 ) 

The gauge-fixing term is then £ fix = — ^tr(D M ^ u ) 2 and the ghost action is £ g h os t = 
— b a (D^(A c J)D^(A c f l + Afflc)' 1 . The action, expanded through quadratic order in the quan- 
tum fields, is of the form 



9 

with = D 2 and 



8tt2 
2 



fc|+ltr / d 4 x{A^M^A^ + 2bM^c} , (5.3) 



= (D 2 5, u - D U D^ + F^) + D^D U = M$ + M$ , 

= % + 2F F , (5.4) 

where we have dropped the subscript cl. Here, stands for the quadratic operator 

coming from the classical action, and is due to the gauge fixing term 16 . (Recall that 
acts on A^ as [F^, A^ u }). In an expansion as in (5.3), one encounters zero modes (i.e. 
normalizable eigenfunctions of the operator M M „ with zero eigenvalues). They are of the 
form 

dA cl 

Zf = + , (5.5) 

where the gauge parameter A 1 is chosen to keep in the background gauge, so that 

Dpf = . (5.6) 

The first term in (5.5) is a solution of (i.e. an eigenfunction with zero eigenvalue), 
as follows from taking the derivative with respect to 7^ of the field equation. Namely, 
SS^/SA* = for all 7i , so 

= /A^' (5 - 7) 

16 To arrive at this expression for M$ , use that = F^ v + (DfA^ u - D^A^ U ) + L4Jf, A£ u ] and note 
th^-^tv2F^[A^,Ar] = ^tvA^[F^,Ar}. 
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The term D^A is also a solution of since it is a pure gauge transformation. 17 The 

sum of the two terms is also a solution of M^ 2 \ because A is chosen such that is in 
the background gauge. As we shall show, the solutions in (5.5) are normalizable, hence 
they are zero modes. Due to these zero modes, we cannot integrate over all quantum 
fluctuations, since the corresponding determinants would vanish and yield divergences in 
the path integral. They must therefore be extracted from the quantum fluctuations, in a 
way we will describe in a more general setting in the next subsection. It will turn out to 
be important to compute the matrix of inner products 

jjij = = -^J d 4 xtr {Zfz^} = ^ J Zf a Zf a tfx . (5.8) 

We put a factor ^ in front of the usual L 2 inner product because the metric U tj will 
be used to construct a measure (det U'^) 1 ^ 2 for the zero modes, and this measure is also 
needed if one considers the quantum mechanics of zero modes ji(t). The action for these 
time-dependent %(t) is U lj/ y i 'jj with the same prefactor p- as in the Yang-Mills gauge 
action. 

We now evaluate this matrix for the anti-instanton. For the four translational zero 
modes, one can easily keep the zero mode in the background gauge by choosing A* = A^. 
Indeed, 

dA cl 

Zl u) = ^ + D X = ~d,Af + DX = K , (5-9) 
which satisfies the background gauge condition. The norms of these zero modes are 

= — = S cl 5^ . (5.10) 
T 

As indicated, this result actually holds for any k, and arbitrary gauge group. 

Next we consider the dilatational zero mode corresponding to p and limit ourselves to 
k = — 1. Taking the derivative with respect to p leaves the zero mode in the background 
gauge, so we can set A p = 0. In the singular gauge of (3.38) we have 

ZV> = -2 , P *'" X " . (5.11) 
^ (x 2 + p 2 ) 2 

(To show that (5.6) is satisfied, note that (d/dx^zjf^ = since is antisymmetric, while 
[A c *zjf^] = since both involve a^ v x u ). Using (B.18) and (B.21), one easily computes 
that 

UPP = ^ = 2S * ■ ( 5 - 12 ) 

r 



17 This is also easy to prove by direct calculation: {D 2 5 I _ LU — D v D li + F^DyK is equal to D V [D U , D p ]A + 
F pu D u k, and this vanishes since [D V ,D^ = F vp and D v F up — 0. More generally SS cl /8A^ ~ D C ^F^ V 
is gauge-covariant, hence D^F^ v {A p + D p A) - D^F^ v (A p ) = [D^F^ V ,A] which vanishes on-shell (field 
equations transform into field equations). Hence ^lf^d (D p A) — M$D v h vanishes. 



36 



In regular gauge one finds zjf^ = ^°+ p z) 2 wmcn nas clearly the same norm. This result 
can also be derived from j^A™ g (k = -1) = + A s ^(k = -l))U and the identity 

U^a^U = -a^x". 

The gauge-orientation zero modes can be obtained from (2.18). By expanding U(9) = 
exp(6 a T a ) infinitesimally in (2.18) we get to lowest order in 6 (the case of general 6 will be 
discussed shortly) 

f)A 

0^ = [^T a ], (5.13) 

which is not in the background gauge (the matrices T a are in the fundamental represen- 
tation). To satisfy (5.6) we have to add appropriate gauge transformations, which differ 
for different generators of SU(N). First, for the SU(2) subgroup corresponding to the 
instanton embedding, we add, for the singular gauge, 

I 
x 2 + p 

and find that 



^ = -^—2 T ^ ( 5 - 14 ) 



Z,,( n \ = D, 



X 2 ' 

-2 i n2 a 



X Z + p 1 



(5.15) 



^2 



(using dp x 2 +p2 = —d^ x 2 p +p2 )- One can now show, using (B.5), that the zero mode (5.15) is 
in the background gauge, and its norm reads 18 

4tt 2 

Uab = -^-P 2 $ab = \&abP 2 S c \ . (5.16) 



18 A few details may be helpful. One finds for this zero mode in the singular gauge, using (3.38) and 
(B.15), 

Zf = 2x^p 2 (x 2 + p 2 y 2 T a + 2r ]b ^e bac T c x l ,p 2 /(x 2 + p 2 ) 2 . 

It is covariantly transversal: acting on the first term plus the commutator of A S ™ E with the second term 
vanishes upon using (B.5). (The commutator of the first term with A s * ng is proportional to (a llu x l ')x^ and 
vanishes). The norm is due to integrating the sum of the square of the first term and the second term, 
using (B.21) with n = 1 and m = 4. All terms which contribute to Z^\ namely gfrr^ 1118 and c^A - 81 " 8 
and [A^ lng , A a ' smg ] fall off as 1/r 3 for large |x|, and Z^f 1 itself is nonsingular at x = 0. 
In regular gauge one finds from (3.40) 

d 7 A r ;s = d 7 u-\d^ + Af*)u = u-%a?*u , 

and the transversality condition becomes 

U^D^A^Up^d^A^U + U^D^A^UU^A^^U] 
= L> M (A ros )[a 7 ^ og + D lt (A^ K )U- 1 A a '^ Bg U] = . 

Hence, A a,rcg = U~ 1 A a ' smg U , and now all contributions to Z^ in the regular gauge fall only off as 1/r. 
Only their sum Z^ a ' ) ' res falls off as 1/r 3 , just as Z^' slng . It is clearly simpler to work in the singular gauge, 
because then all integrals separately converge. 
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We need the gauge-orientation zero modes for arbitrary values of 9 because this is needed 
for the group (Haar) measure. They are obtained as follows. By differentiating U (9) and 
using that U^-^U is equal to e a a (9)T a , where the function e a a (9) is called the group 
vielbein (with a a curved and a a flat index according to the usual terminology 19 ), one 
obtains 

d 



9p;M0) = [M0),e* a (0)Ta] (5.17) 

2 

For A( a ) we take now A( a )(0) = — x 2 p +p 2 e a a (9)T a , and then we obtain for the gauge zero 
modes at arbitrary 9 

Z» {a) {6) = D,(A(9)) (-^ e "Wa) 

D,(A(9 = 0)) y^ 2 d a UU-^ 



= u- 1 



U . (5.18) 



We define 20 dJJU' 1 = f a a (9)T a . Note that tr daUU^dpUU' 1 = tr {U~ l d a U U^dpU) = 
e a a ep b tiT a T b = f a a fp b ti T a T b . Hence the left-invariant metric e a a ep b 5 a b is equal to the 
right-invariant metric. There is a geometrical interpretation of these results [53, 54] . 
There are only two differences with the 9 = case 

(i) the factors U(9) and U~ l (9) in front and at the back; these drop out in the trace 

(ii) the factors of f a a multiplying T a . Taking the trace one obtains the group metric 

U a %9) = (Zj? | Zf) = e a a (9)ep b (9)U ab (9 = 0) = e a a (9)e^(9) (|p 2 5 cl ) . (5.19) 

Hence, in the square root of the determinant of U one finds a factor det e a a (because 
det(e a a 5 ab e l3 b ) = (dete a a ) 2 ), and this yields the Haar measure 

H(0) = det e a a (6)d 3 e . (5.20) 

Using this measure one can calculate the group volume V of SU(2), V = J(dete a a )d 3 9, 
which is independent of the choice of coordinates 9. (We chose the parametrization U (9) = 
exp9 a T a , but any other parametrization yields the same result.) 

19 The group vielbein is given by 

e a a ( ) T a = T a + i [T a , 9 ■ T] + 1 [ ( T a ,6 ■ T] , 9 ■ T] + ■ ■ ■ , 

whereas the adjoint matrix representation M a,di {9) is given by 

e- T T a e T = M aA \9) a b T b = T a + [T a , 6 - T] + ■ ■ ■ . 

One has M a,d ' i (9) a h = (cxp9 c f. c ) a b . There is arelation between the group vielbein and the adjoint matrix: 
(9^ + l)e a b (6) = (M^(9)) a b 

20 The functions e a a (9) are sometimes called the left-invariant one-forms, while f a a (9) are the right- 
invariant one-forms. 
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We have now calculated all norms. It is fairly easy to prove that there is no mixing 
between the different modes, for example = U^a = U^ p \ = 0. Thus the matrix U lj 

for SU(2) is eight by eight, with non-vanishing entries along the block-diagonal 



cl 



25, 



cl 



V 



\g af 5{0)p 2 s d ) 



(5.21) 



The square root of the determinant is 



U = \St xP ^detg a ^6) = yjdetg a p(8) (for SU(2)) . (5.22) 

Let us now consider the remaining generators of SU(N) by first analyzing the example 
of 577(3). For simplicity, we restrict ourselves again to lowest order in 8 a . There are seven 
gauge orientation zero modes, three of which are given by (5.15) by taking for T a the 
first three Gell-Mann matrices Ai, A2, A3 multiplied by — |. For the other four zero modes, 
corresponding to A4, . . . , A7, the formula (5.13) still holds, but we have to change the gauge 
transformation in order to keep the zero mode in background gauge, 



A* 



X 2 + p 2 



T k , fc = 4,5,6,7, 



(5.23) 



with Tk = (—i/2)\k- The difference in x-dependence of the gauge transformations (5.14) 
and (5.23) is due to the change in commutation relations. Namely, Yll=i[^a, [A a , A^]] = 
— (3/4)A j a for /3 — 4, 5, 6, 7, whereas it is — 2A^ for /3 — 1, 2, 3. (These are the values of the 
Casimir operator of SU(2) on doublets and triplets, respectively). As argued before, there 
is no gauge orientation zero mode associated with As, since it commutes with the SU(2) 
embedding. The zero modes are then 



D, 



x z 



x 2 + p 2 



k = 4,5,6,7 , 



with norms 21 



U, 



ki 



(5.24) 



(5.25) 



and are orthogonal to (5.15), such that Uk a = 0. This construction easily generalizes to 
SU(N). One first chooses an SU{2) embedding, and this singles out 3 generators. The 

21 These zero modes axe given by = p 2 x v /(v / x 2 (x 2 + p 2 ) 3 ^ 2 )(S^T k + 2i] a ^ ly [T a ,T k }) in the singular 

gauge, see (3.38). For the first three zero modes we found instead Z^/ a \ = p 2 x v j{x 2 + p 2 ) 2 (28 tJil ,T a + 
2»fc /i „[T 6 ,T ]) with [T b ,T a ] = e bac T c . The norm of (5.24) is proportional to tr T fe T ; + 4tr [T a , T k ][T a , T,] = 
4tr T k Ti, where we used (B.5). 
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other generators can then be split into 2(iV — 2) doublets under this SU (2) and the rest are 
singlets. There are no zero modes associated with the singlets, since they commute with 
the SU(2) chosen. For the doublets, each associated zero mode has the form as in (5.24), 
with the same norm \p 2 S c \. This counting indeed leads to AN — 5 gauge orientation zero 
modes. Straightforward calculation for the square-root of the complete determinant then 
yields an extra factor (\p 2 S c \) 2 ^ N ~ 2 \ and so 

2 2N+7 / nn \ AN 

VU = — m (for SU(N)) . (5.26) 

This result is a factor 2 4iV ~ 5 smaller than [52, 1], since we chose U{6) = exp9 a T a instead 
of exp(2# a T a ). This ends the discussion about the (bosonic) zero mode normalization. 



5.2 Construction of the fermionic zero modes 

In this subsection we will explicitly construct the fermionic zero modes (normalizable so- 
lutions of the Dirac equation) in the background of a single anti-instanton. For an SU (2) 
adjoint fermion, there are 4 zero modes according to (4.37), and these can be written as 
follows [63] 

X a = -\a pa % (V - a^'fj Y (x - x ) v ) Fp/j . (5.27) 

The SU(2) indices u and v are carried by (\ a ) u v and (F pa ) u v . 

To prove that these spinors are solution of the Dirac equation, use o^Opo = Sp P a a — 
Sucr^o — ^ixpcr^T- Then vanishes since D pFp a vanishes when contracted with f]pp, Tj^ a 
or e^po-T- Actually, this expression also solves the Dirac equation for higher order k, but 
there are then additional solutions, A\k\ in total for SU(2), see (4.37). The four fermionic 
collective coordinates are denoted by £ a and fj 7 >, where a, 7' = 1,2 are spinor indices in 
Euclidean space 22 . They are the fermionic partners of the translational and dilatational col- 
lective coordinates in the bosonic sector. These solutions take the same form in any gauge, 
one just takes the corresponding gauge for the field strength. The canonical dimension of 
£ and f\ is —1/2 and 1/2, respectively. 

For SU(N) (and always k = —1) there are a further set of 2 x (N — 2) zero modes in 
the adjoint representation, and their explicit form depends on the gauge chosen. In regular 
gauge, with color indices u, v = 1,...,N explicitly written, the gauge field is given by 
(3.41) (setting xo = 0, otherwise replace x — > x — x ) 

v. = W. = -££, = ( ^ ) " (5 ' 28) 

22 To check that the expression with fj is a solution, one may use that u p Up_ v a p — 0. Note that one may 
change the value of x in (5.27) while keeping fixed, because the difference is a solution with 
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Then the corresponding fermionic instanton in the adjoint representation reads 

\ a \ = f (^S\ + e au fl v ) . (5.29) 

Here we have introduced Grassmann collective coordinates 



//* = (/,... , li N ~\ 0,0) ;e c 



/ o,...,o, 

e al3 ' | with N - 2 + f3' = u , (5.30) 

V o,...,o, 



and similarly for fi v and 8 a v . Thus the SU(N) structure for the fermionic instanton is as 
follows 

A «(?/.). (5.31) 

The canonical dimension of p and /i is —1/2. To prove that (A Q ) u t, in (5.29) satisfies the 
Dirac equation cr^c^A + [A^, A]) = 0, note that the terms {A^) u w p w and /^(A^)"^ vanish 
due to the index structure of and fi, Jx. Because A^ has only nonzero entries in the 
lower right block, there cannot be fermionic instantons in the upper left block. 
In singular gauge, the gauge field is given by (3.38) 

n2 



Au u — 9/ 9 , r,\°'UVU X U ■ (5.32) 

x 2 (x 2 + p 2 ' 



Notice that the position of the color indices is different from that in regular gauge. This is 
due to the natural position of indices on the sigma matrices 23 . The fermionic anti-instanton 
in singular gauge reads [55] 

AV = P (fi u x av + x a u ji v ) , (5.33) 

where for fixed a, the iV-component vectors fi u and x av are given by 

/i u = (»!,..., /Mr-2,0,0) , x av = (o,...,0,x»a^ with N - 2 + (3' = v . (5.34) 

Further, x a u = x av e vu and p" also has N — 2 nonvanishing components. The particular 
choice of zeros in the last two entries corresponds to the choice of embedding the SU (2) 
instanton in the lower-right block of SU(N). Notice that the adjoint field A is indeed 
traceless in its color indices. This follows from the observation that \x and \x only appear at 



23 To be very precise, we could have used different Pauli matrices (r a ) u v for the internal SU(2) generators. 
Then we could have defined a matrix (a All/ ) tl t , by = ir\ ailu T a , and the SU(2) indices in (5.32) and (5.33) 
would have appeared in the same position as in (5.29). It is simpler to work with only one kind of Pauli 
matrices. 
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the off-diagonal blocks inside SU(N). In general // and fi are independent, but if there is a 
reality condition on A in Euclidean space, the \i and \x are related by complex conjugation. 
We will discuss this in a concrete example when we discuss instantons in Af = 4 super 
Yang-Mills theory. We should also mention that while the bosonic collective coordinates 
are related to the rigid symmetries of the theory, this is not obviously true for the fermionic 
collective coordinates, although, as we will see later, the £ and f\ collective coordinates can 
be obtained from ordinary supersymmetry and conformal supersymmetry in super Yang- 
Mills theories. 

A similar construction holds for a fermion in the fundamental representation. Now there 
is only one fermionic collective coordinate, see (4.37), which we denote by /C. The explicit 
expression for k = —1 in singular gauge is 24 

{X a ) u = P x a u K . (5.35) 

\Jx 2 (x 2 + p 2 ) 3 

In regular gauge it is given by 

The Dirac equation for (\ a ) u is proportional to 

-Wo^^ u - a^pe^ia^V (5.37) 

and to show that this vanishes one may use (a = (^^) u/3 and the symmetry of the 
Lorentz generators (cr^ u ) uf3 = (a flv )^ u and a^a^ = 3a u . 

6 The measure for zero modes 

Having determined the bosonic and fermionic zero modes for k = ±1 instantons with 
SU (N) gauge group, we now discuss the measure for the zero mode sector of path integrals. 
The one-loop corrections due to the nonzero modes, will be discussed in the next section. 

6.1 The measure for the bosonic collective coordinates 

We now construct the measure on the moduli space of bosonic collective coordinates, and 
show that the matrix U plays the role of a Jacobian. We first illustrate the idea for a 

24 The color index should again be written as (\ a ) u , because A reg '" = (U) uv ' \ s j s with U uv ' = 
er™ 1,,/Va?. However, we drop these primes. The proof that (5.35) satisfies the Dirac equation uses 
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generic system without gauge invariance, with fields (fi A , and action S[4>] (for example, the 
kink in one dimension). We expand around the instanton solution 

<j> A (x)=<f>i(x rl ) + <f>A n (x il ) . (6.1) 

The collective coordinates are denoted by 7 and, for notational simplicity, we assume there 
is only one. At this point the fields (f)^ u can still depend on the collective coordinate, as 
they can include zero modes. The action, up to terms quadratic in the quantum fields, is 

S = S d + IC M ab (0ci) C ■ (6.2) 
The operator M has zero modes given by 

Z* = 9 -0 , (6.3) 

since, as we explained in (5.7), MabZ b is just the derivative of the field equation dS c \/d(f) A x 
with respect to the collective coordinate. More generally, if the operator M is hermitian 
(or rather self-adjoint 25 ), it has a complete set of eigenfunctions F a with eigenvalues e a , 

M AB F* = e a F* . (6.4) 

One of the solutions is of course the zero mode Z = Fq with eo = 0. Any function can be 
expanded into a basis of eigenfunctions, in particular the quantum fields, 

a 

with coefficients £ a . The eigenfunctions have norms, determined by their inner product 

(F a \F,) = J d*xF*(x)F*(x). (6.6) 

The eigenfunctions can always be chosen orthogonal, such that (F a \Fp) = S a /3U a . The 
action then becomes 

S = S d + l^2 tat a e a u a . (6.7) 

a 

If there is a coupling constant in front of the action (6.2), we rescale the inner product 
with the coupling, such that (6.7) still holds. This was done in (5.8). The path-integral 
measure is now defined as 

00 / — 

a=0 v 



25 More precisely, if there is an inner product ((f>i, fa) — J 4>iHab4>2 ( ^ 4x with real fa, fa and with metric 
Ha B , and Ha B H bc — Sa c , then one may define 4> a Hab — <Pb so that {fa, fa) — J faA$2^ x - ^ onc 
further defines H bc M C d = M B D , then M A B is hermitian if (fa,Mfa) = (Mfa,fa). The need for a 
matrix to define an inner product is familiar from spinors, but for bosons the metric is in general trivial 
(H A b = Sab)- 
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We perform the Gaussian integration over the £ Q and get 



/ 



[d0] e~ s W = / J^d^e-^idet'M)- 1 / 2 . (6.9) 

./ V 27T 



One sees that if there were no zero modes, the measure in (6.8) produces the correct result 
with the determinant of M. In the case of zero modes, the determinant of M is zero, and 
the path integral would be ill-defined. Instead, we must leave out the zero mode in M, 
take the amputated determinant (denoted by det'), and integrate over the mode £o- By 
slightly changing some parameters in the action (for example by adding a small mass term) 
the zero mode turns into a non-zero mode, and then one needs y^d^o as measure. So, 
continuity fixes the measure for the zero modes as in (6.8). 

The next step is to convert the £o integral to an integral over the collective coordinate 
7 [56]. This can be done by inserting unity into the path integral. Consider the identity 

d7*(/(7))|£ , (6-10) 

which holds for any (invertible) function 7(7). Taking 7(7) = — (0— cl (7)|Z), and recalling 
that the original field is independent of 7, we get 

1 = j d7 - (Vqu 5 ((<M^)) = j d 7 - (Vqu ^ S (&«o) " 

(6.11) 

This trick is similar to the Faddeev-Popov trick for gauge fixing. In the semiclassical 



approximation, the term (0 



qu 



is subleading and we will neglect it 26 . The integration 



over £0 is now trivial and one obtains 

[d0] e" 5 = / d 7 A /^e" 5cl (det'M)" 1/2 . (6.12) 

J V 27T 

For a system with more zero modes Z % with norms-squared U % \ the result is 27 

J [d0] e" 5 = J J] (det ufPe-s* (del'M)^ 2 . (6.13) 



26 It will contribute however to a two-loop contribution. To see this, one first writes this term in the 
exponential, where it enters without %, so it is at least a one-loop effect. Then </> qu has a part proportional 
to the zero mode, which drops out by means of the delta function insertion. The other part of <^ qu is 
genuinely quantum and contains a power of % (which we have suppressed). Therefore, it contributes at 
two loops [57] (see also [58] for related matters). 

27 0ne obtains from (6.11) det (d^A^Z^) times (det U ab )-^ 2 . The matrix elements (d 7i Af\Z^) are 
equal to = U ij minus (D^A^lZ^). The latter term can be partially integrated, and vanishes 

since there are no boundary contributions, neither in the singular nor in the regular gauge. (For the regular 
gauge one needs an explicit calculation to check this statement.) 



44 



Notice that this result is invariant under rescalings of Z, which can be seen as rescalings 
of the collective coordinates. More generally, the matrix Uij can be interpreted as a metric 
on the moduli space of collective coordinates. The measure is then invariant under general 
coordinate transformations on the moduli space. 

One can repeat the analysis for gauge theories to show that (6.13) also holds for Yang- 
Mills instantons in singular gauge. For regular gauges, there are some complications due 
to the fact that neither of the two terms in (5.5) does fall off fast at infinity, but only 
their sum is convergent. In singular gauge, each term separately falls off fast at infinity. 
For this reason, it is more convenient to work in singular gauge. The measure for the 
bosonic collective coordinates for k — 1 SU(N) YM theories, without the determinant 
from integrating out the quantum fluctuations which will be analyzed in the next section, 
becomes 

2 4»+2 jr 4<*- 2 1 f j4 dp m 



This formula contains the square-root of the determinant of U in (5.26), 4iV factors of 
l/y/2n, and we have also integrated out the gauge orientation zero modes. This may be 
done only if we are evaluating gauge invariant correlation functions. The result of this 
integration follows from the volume of the coset space 



SU(N) \ 2 



)4iY-5_27V-2 



V01 ( SU(N - 2) x 17(1) J (N-l)\(N-2)\ ' (6 ' 15) 

which is a factor 2 4N ~ 5 larger than in [52, 1], because we have used the normalization 
tr(T a T fe ) = -\5ab, while in [52, 1] ti{T a T b ) = -25 ab was used. We found in (5.26) another 
factor 2~( 4Ar ~ 5 \ and indeed the result for the total measure in (6.14) is the same as in 
[52, 1]. The derivation of this formula can be found in Appendix C, which is a detailed 
version of [52]. 



6.2 The measure for the fermionic collective coordinates 

We must also construct the measure on the moduli space of fermionic collective coordinates. 
Consider (5.27). The fermionic zero modes are linear in the Grassmann parameters £ a and 
fj a >. Thus these £ Q and fj a i correspond to the coefficients £ a in (6.5). One obtains the zero 
modes by differentiating \ a in (5.27) w.r.t. £ a and fj a >, and for this reason one often calls 
these and f} a > the fermionic collective coordinates. This is not quite correct, because 
collective coordinates appear in the classical solution (the instanton) but we shall use 
this terminology nevertheless because it is common practice. We use again the measure 
in (6.8). There are in this case no factors ^== because of the Grassmann integration, 
and instead of (detM')^ 1 / 2 we now obtain (detM') 1 / 2 in (6.9). Because the parameters 
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^ a ,fj a , etc. appear linearly in the zero modes, we do not need the Faddeev-Popov trick 
to convert the integration over zero modes into an integration over collective coordinates. 
So for fermions the Grassmannian coefficients of the zero modes are at the same time the 
collective coordinates. 

We shall discuss these issues in more detail when we come to supersymmetric gauge 
theories, but now we turn to computing the norms of the fermionic zero modes. 
For the zero modes with £ in (5.27), one finds 

d\ a 

Z {P) = ~Q£P = ~2 a ^ a p F »v ■ (6.16) 
The norms of these two zero modes are given by 

(f^V = -~2 I d ^ tr { Z ^)Z a{l) } = 45 d V , (6.17) 
9 J 

where we have used the definition in (5.8) and contracted the spinor indices with the usual 
metric for spinors. This produces a term in the measure 28 

J dede (4^)^ . (6.i8) 

The result (6.18) actually holds for any k. We get the square root of the determinant 
in the denominator for fermions. One really gets the square root of the super determi- 
nant of the matrix of inner product, but because there is no mixing between bosonic and 
fermionic moduli, the superdeterminant factorizes into the bosonic determinant divided by 
the fermionic determinant. 

For the fj zero modes, we obtain, using some algebra for the a-matrices, 

Z"? = d\ a /df)f3> = l(a^a p ) af3 'F„„x p , (6.19) 

and 

{U n )J' = 8S cl 5j'p 2 , (6.20) 
so that the corresponding measure is 



/ 



d^ (Sp 2 ^)" 1 , (6.21) 



which only holds for k — 1. 

Finally we compute the Jacobian for the fermionic "gauge orientation" zero modes. 
For convenience, we take the solutions in regular gauge (the Jacobian is gauge invariant 
anyway), and find from (5.29) 

(Z? w) ) u =— 9 5 a v A u w , (Z? n ))"=—. 9 e au A w v , (6.22) 



28 Somctimcs one finds in the literature that U$ — 2S C \- This is true when one uses the conventions for 
Grassmann integration f d 2 £ £ a £^ = \e al3 '. In our conventions d 2 £ = d^d^ 2 . 
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where the N by N matrix A is the unity matrix in the (N — 2) by (N — 2) upper diagonal 
block, and zero elsewhere. So A restricts the values of u, w and v to up to N — 2 while in 
5 a v and e av the index runs over the next two values. Consequently, the norms of and 
Zp_ are easily seen to be zero, but the nonvanishing inner product is 

(U„)\ = -j 2 f d'xtr Z^ u) Z a{ ^ = 2 -^A\ , (6.23) 

where we have used the integral (B.21). It also follows from the index structure that the £ 
and fj zero modes are orthogonal to the \x zero modes, so there is no mixing in the Jacobian. 

Putting everything together, the fermionic part of the measure for M adjoint fermions 
coupled to SU(N) YM theory, with k — 1, is given by 

AT/AT \ . 2 , AT M /N-2 \ , 2 , ^(JV-2) 

(6.24) 

Similarly, one can include fermions in the fundamental representation, for which the Jaco- 
bian factor is 

U K = [ d 4 xZ a u Z a u = n 2 , (6.25) 




for each species. Here /C is the Grassmann collective coordinate of (5.36). Hence in this 
case the fermionic part of the measure is 

for Nf fundamental Weyl spinors coupled to SU(N) YM theory with k — 1. 

Note that we did not put a factor in front of the integral in (6.25), whereas we used 
such a factor for fermions in the adjoint representation. The reason we do not use such a 
factor for fermions in the fundamental representation has to do with the action. One finds a 
factor ^2 in front of the Yang-Mills action, and therefore also, by susy, in front of the Dirac 
action for gluinos. However, in the matter action the ^-dependence has been absorbed by 
the gluons, so there is no factor in front of the matter fermions. The measure of the zero 
modes uses the metric of the collective coordinates. In soliton physics (and instantons can 
be considered as solitons in one higher dimension) one obtains this metric if one lets the 
collective coordinates become time dependent and integrates over d 4 x in the action one 
ends up with a quantum mechanical action of the 

C = (f/%7, + Uapti* + U a ' p 'Va'h' + U AB k A K B (6.27) 

Since XJ % \ U a p and XJ a 'P' are produced by the Yang-Mills action and its susy partner, while 
Uab is due to the matter action, there is no (^-dependence in (6.26). 
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7 One loop determinants 



Having determined the measure on the moduli space of collective coordinates, we now com- 
pute the determinants that arise by Gaussian integration over the quantum fluctuations. 
Before doing so, we extend the model by adding real scalar fields and Majorana fermions 
in the adjoint representation. The action is 

S=-~ 2 j d 4 xtr {\F^F^ + {D^){Dp4>)-i\l?>\-i\I?>\} . (7.1) 

Here, A is a two-component Weyl spinor which we take in the adjoint representation 29 . In 
Minkowski space there is a reality condition between the two complex 2-component spinors 
A and A, and as a result Aq, transforms in the complex conjugate of the representation of 
A Q , but in Euclidean space this reality condition is dropped. So A Q and X a > are independent 
complex variabes. For the Grassmann integration this makes no difference. Written with 
indices the Euclidean Dirac action in (7.1) reads {— iX^a^) 0113 ' D^Xpr — iX a> (a^a/pD^X^} 
where X a = X^ep a and X a ' = e a '^'Xpi. Generalization to fundamental fermions is straight- 
forward. The anti-instanton solution around which we will expand is 

A*, 0d = O, A cl = 0, A cl = 0, (7.2) 

where A c ^ is the anti-instanton. This background represents an exact solution to the field 
equations. The bosonic and fermionic zero modes are taken care of by the measure for 
the collective coordinates, while in the orthogonal space of nonzero modes, one can define 
propagators and vertices, and perform perturbation theory around the (anti-) instanton. 

After expanding = A c ^ + A^ u , and similarly for the other fields, we add gauge fixing 
and ghost terms 

S g f = ~ J d*xtr{(D*A^) 2 -2bDlc} , (7.3) 

such that the total gauge field action is given by (5.3). The integration over A^ gives 

[det'A^f 172 , A„„ = —D 2 5 fiu - 2F flI/ , (7.4) 

where the prime stands for the amputated determinant, with zero eigenvalues left out. We 
have suppressed the subscript 'cl' and Lie algebra indices. Integration over the scalar fields 
results in 

[detA^r 1/2 , A<p = -D 2 , (7.5) 



29 As before A = ( - ) , but the 4-componcnt Majorana spinor A is denned by A T C both in Minkowski 

and in Euclidean space, where C is the charge conjugation matrix, C = ^ d/3 ^ ' Then A = 

(A a , — A Q ) and Lorentz (or rather SO{4)) invariance is preserved in Euclidean space because the relation 
C"f^ = ~^' T C holds in both spaces. In Euclidean space we denote the indices of A by a' instead of a. 
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and the ghost system yields similarly 

[detA gh ] , A gh = —D 2 . (7.6) 

For the fermions A and A, we need a bit more explanation. Since neither p nor fl is 
hermitean(even worse, p maps antichiral spinors into chiral spinors), we cannot evaluate 
the determinants in terms of their eigenvalues. But both products 

A- = — jp jp = —D 2 — \a^ v , A + = — jp jp = —D 2 , (7.7) 

with spinor indices still suppressed, are hermitean. Let us label the nonzero modes by a 
subscript i. Then we can expand A in terms of commuting eigenfunctions F of A_ with 
anticommuting coefficients £j, and A in terms of eigenfunctions F of A + with coefficients £j. 
We have seen before that both operators have the same spectrum of non-zero eigenvalues 
ej, and the relation between the eigenfunctions is F = -^=pFi and Fj = -^=pFi. (The 
minus sign is needed in order that F = ~^P^i = ~t(~PP)Fi = Fi)- Defining the path 
integral over A and A as the integration over £j and £j, one gets the determinant over the 
nonzero eigenvalues 30 . The result for the integration over the fermions can be written in 
symmetrized form as 

[det'A_] 1/4 [detA + ] 1/4 . (7.9) 

As stated before, since all the eigenvalues of both A_ and A + are the same, the determi- 
nants are formally equal. This result can also be obtained by writing the spinors in terms 
of Dirac fermions; the determinant we have to compute is then 

[det'Ay 1 ' 2 , * B =(l*). (7.10) 



30 Namely, the action becomes 

ft. 



-i ( liFi Mj y - itijFj IX, 



r 



Next we use that the norms of Fj and Fj are equal: 



(7.8) 



= -^=(m\^=pF^ = -^=(Fi\-^m) 

v e « \ V 3 I V e » 3 



Hence, as expected, the Fj and Fj for different eigenvalues are orthogonal to each other, and the norms of 
F; and Fj are the same. Denoting ^ J d 4 x(Ff)*F 1 a =< FfjFf > by m, one finds for the path integral 



/ 



Hence the measure is 4$=^=, and the one-loop determinant is TLfe) 1 ^ 2 - 
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One would expect that in a supersymmetric model with vectors, spinors and scalars, 
the sum of all zero point energies cancel. These zero point energies correspond to the 
one-loop determinants in an external Yang-Mills field. So this suggests that all one-loop 
determinants are related, and since the one-loop determinants of fermions depend on A + 
and A_, one would expect that the determinants for the bosons can be expressed in terms 
of the determinants of A_ and A + . For the ghosts and adjoint scalars this is obvious, 



det A^ = det A g h = [det A H 



,1/2 



(7.11) 



We get det A^ = det(-L> 2 ) = det A+ /2 and det A gh = det(-L> 2 ) = det A 1 / 2 because the 
spinor space is two-dimensional. 

For the vector fields, we rewrite the operator A pv in (7.4) in terms of the fermion operator 
A_. Using tr(0- M ov) = 2<5 M „ and ti(a p a pa a u ) = 2(5 w S au - S pa 5 pu - e ppav ) we obtain the 
following identity for A pv = -5 pu D 2 - 2F flv , 

A^ = lti{a p A_a u } = la pa , f3 (AJ^)a2 a ' 
= |(^)(A_V a V)(^ 75 ') 



(7.12) 



where (A_)^5 a ' s > is block-diagonal on the basis f3a' = 7<5' = (11), (21), (12), (22). 



A_ 2 2 



/ A_ x i AJ 2 
A_\ 



V o o 






A. 2 , 






A_ 2 2 



(7.13) 



This proves that 
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det'A 



flU 



[det'A. 



(7.14) 



Now we can put everything together. The one-loop determinant for a Yang-Mills system, 
including the ghosts, coupled to n real adjoint scalars and Af Weyl spinors (or Majorana 
spinors) also in the adjoint representation is 



[det'A. 



-l+Af/4 



[detA+] 



-,{2+N-n) 



This expression simplifies to the ratio of the determinants when Af — 
cases are 

3/4 



(7.15) 
Particular 



M = 1 n = 



det A.< 
det'A. 



31 Consider a p ^ a ip and a^ s as 4 x 4 matrices. Then on the right-hand side of (7.12) one has the product 
of three 4x4 matrices. For fixed fi and v one has o^tpof* — 2<^", hence det^,^] = 4. 
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M = 2 n = 2 



detA+1 1/2 
det'A_ 



N = 4 n = 6 



dct A 



(7.16) 



dct'A_ 



These cases correspond to supersymmetric Yang-Mills theories with A/"-extended super- 
symmetry. Notice that for M = 4, the determinants of A + and A_ separately cancel, so 
there is no one-loop contribution. 

For J\f = 1, 2 the determinants formally give unity since the non-zero eigenvalues are the 
same. However, one must first regularize the theory to define the determinants properly. 
After regularization, the renormalization procedure must be carried out and counterterms 
must be added. The counterterms are the same as in the theory without instantons and 
their finite as well as infinite parts must be specified by physical renormalization conditions. 
The ratios of products of non-zero eigenvalues can be written as the exponent of the 
difference of two infinite sums 



with eigenvalues A n = expco n . The frequencies uon and uon can be discretized by putting 
the system in a box of size R and imposing suitable boundary conditions on the quantum 
fields at R (for example, 4>(R) = 0, or -^4>(R) = 0, or a combination thereof [4]). These 
boundary conditions may be different for different fields. The sums over Urt^ and Un ^ are 
divergent; their difference is still divergent (although less divergent than each sum sepa- 
rately) but after adding counterterms AS* one obtains a finite answer. The problem is that 
one can combine the terms in both series in different ways, giving different answers. By 
combining oo^ with u)n ^ for each fixed n, one would find that the ratio (det A + / det' A_) 
equals unity. However, other values could result by using different ways to regulate these 
sums. We have discussed before that for susy instantons the densities of nonzero modes 
are equal, hence for susy instantons the contributions in (7.16) from the one-loop deter- 
minants cancel. This makes these models simpler to deal with than non-susy models. For 
ordinary (nonsusy) Yang-Mills theory, the results for the effective action due to different 
regularization schemes differ at most by a local finite counterterm. In the background field 
formalism we are using, this counterterm must be background gauge invariant, and since 
we consider only vacuum expectation values of the effective action, only one candidate is 
possible: it is proportional to the gauge action J d 4 xtrF 2 and multiplied by the one-loop 
beta-function for the various fields which can run in the loop, 




(7.17) 




(7.18) 
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" det'A_ " 


-i 


det A+ 


_detA (0) _ 




_detAf_ 



The factor In (p 2 / pi) parametrizes the freedom in choosing different renormalization schemes. 

A particular regularization scheme used in [4] is Pauli-Villars regularization. In this 
case 't Hooft first used x-dependent regulator masses to compute the ratios of the one-loop 
determinants A in the instanton background and A^ ) in the trivial vacuum. Then he 
argued that the difference between using the x-dependent masses and using the more usual 
constant masses, was of the form AS* given above. The final result for pure YM SU (N) in 
the \k — 1| sector is [4, 52] 

1/2 

= exp { §iV \n(pp) - a(l) - 2(N - 2) a (§) } . (7.19) 

Here we have normalized the determinants against the vacuum, indicated by the superscript 
(0). From the unregularized zero mode sector one obtains a factor p AN , see (6.14), and 
Pauli-Villars regularization of the 4N zero modes yields a factor Mpy. All together one 
obtains ^ + y ln(Mpyp) in the exponent for SU(2), where go is the unrenormalized 
coupling constant. Subtracting y In(MpvPo) to renormalize at mass scale 1/po, one is left 
for the effective action with ^ — y ln(p/po) = p^y- Replacing ln(p/po) by ln(/x//xo), this 
is the correct one- loop renormalization equation for the running of the coupling constant. 
For supersymmetric theories, the nonzero mode corrections to the effective action cancel, 
and performing the same renormalization procedure as for the non-supersymmetric case, 
one now obtains only from the zero modes the correct (3 function. For M = 4 one finds a 
vanishing /3 function. 

The fluctuations of the SU (2) part of the gauge fields and the Faddeev- Popov ghosts 
yield the term a(l) in (7.19), while the fluctuations of the 2(iV — 2) doublets (corresponding 
to A 4 , • • • A 7 for 577(3)) yields the term with a (|). The numerical values of the function 
a(t) are related to the Riemann zeta function, and take the values a (|) = 0.145873 and 
a(l) = 0.443307. Notice that this expression for the determinant depends on p, and 
therefore changes the p-dependence of the integrand of the collective coordinate measure. 
Combined with (6.14) one correctly reproduces the /3-function of SU(N) YM theory. The 
calculation of the contribution of the nonzero modes can be simplified by using a so-called 
0(5) formalism [59] which uses the conformal symmetries of instantons, in addition to 
the nonconformal symmetries. One still has to regulate the sums over zero-point energies, 
and both Pauli-Villars regularization [59] and zeta-function regularization [60] have been 
applied to the 0(5) formulation. 

7.1 The exact (3 function for SYM theories 

In supersymmetric gauge theories, the contributions to the one-loop partition function by 
the nonzero modes in the bosonic and fermionic loops cancel each other [61]. Although this 



52 



has only been shown to occur in a gravitational background without winding, we assume 
here that still occurs in an instanton background. Actually, all contributions from the 
nonzero mode sector cancel: higher-loops as well as possible nonperturbative corrections. 
The zero mode sector can be regularized by Pauli-Villars fields, and since the partition 
function yields a physical observable, namely the cosmological constant (the sum over zero- 
point energies), the result for the partition function should not depend on the regularization 
parameter M PV (the Pauli-Villars mass). From this observation one can derive a differential 
equation for the coupling constant g(M PV ), which yields the exact (3 function: it contains 
all perturbative contributions [62]. 

Before going on we should comment on the fact that from the 3-loop level on the result 
for the /3 function depends on the regularization scheme chosen. It is sometimes claimed 
that therefore higher-loop results for the (3 function have no meaning. This is incorrect: 
given a particular scheme, all orders in perturbation theory of (3 have meaning. In the 
derivation below of the (3 function we shall find an all-order result, but it is not (yet?) 
known which regularization scheme for Feynman graphs would reproduce these results. So 
the all-order expression for (3 has in principle meaning, but in practice one cannot do much 
with it. One can only say: there must exist a regularization scheme which, if used for the 
calculation of higher-loop Feynman graphs, will produce the all-order result for (3 obtained 
below. 

We begin with pure supersymmetric gauge theory. We recall that the measure of the 
zero modes of a single instanton or anti-instanton (k = ±1) for N = 1 susy with gauge 
group SU(N) and one Majorana or Weyl fermion in the adjoint representation is given by 



dM k -_ 



-±i 



d xd P_ 2 4N+2 I P\ I M PV \ 7T 4iV 



P 5 \gj \V^J (W-l)!(JV-2)! 



X 



d£id£ 2 dr]idr] 2 Y[^ = ? dp u dp u 

, JV-2 



v °' W-TW (7 - 20) 



AS d M PV 8p 2 S cl M PV (lS cl M Pv y 

where S c \ = 8n 2 /g 2 . The volume of the gauge group was given in (6.15) but because 
it does not depend on g or M PV it will play no role below. Note that this measure is 
dimensionless; d 4 xdp/p 5 is dimensionless, and the remaining p and M PV occur only in the 
combination pM PV . Also d 2 ^/M PV and d 2 fj/(p 2 M PV ) are dimensionless. The prefactor 
e -sn 2 / g 2 j g Q f course the classical action for the one-instanton background, and we have left 
out the term with the term with the theta-angle. In the first square brackets we find the 
product of the measure for the bosonic zero modes in (6.14) and factors \J for each 
bosonic zero mode from the corresponding Pauli-Villars modes. 32 The second expression 
in square brackets contains the contribution to the measure from the fermionic zero modes 



:S2 



If the one-loop determinant for the bosonic fields is (detM;,) 1 / 2 and for the fermionic fields detM/, 
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given by (6.24), with factors for each fermionic zero mode. Clearly, each bosonic 

zero mode contributes a factor Mpy/g and each fermionic zero mode contributes a factor 

The dependence of dM on M PV and g is thus as follows 

2 / -I \ 2N 

dM oc e~^{M PV f N (-) , (7.21) 

where g depends on Mpy, so g = g(Mpy). So g is the bare coupling constant in the 
regularized theory, and g and Mpy vary such that the renormalized coupling constant g R 
is kept fixed. Usually one considers the renormalized coupling constant as a function of 
the renormalization mass fj,, and then the bare coupling constant g satisfies ^§^g = 0. 
Using dimensional regularization and g = Z g (g ren )g ren jj, e / 2 with e = 4 — n yields then the 
(3 function. If one uses Pauli-Villars regularization there are two masses which play a 
role: the cut-off (regulator) mass M PV and the physical renormalization mass /i. The bare 
coupling depends on one of them, the renormalized coupling on the other. 

g = g(Mp V ) g R = g R (fi) 



Mp V7 ^g(M PV ) = (3(g) ptd/dpt g R (fi) = f3(g R ) (7.22) 
a 



dM PV ' 

fi£g(M PV ) = Mpv^-gM = 



*dniJ\^rv ) - « xv± rv QMpv . 

Physical quantities depend on \x but not on Mpy. If one wants to apply the renormalization 
group to the measure, one must use the approach based on (Mpyd/dMpy)g R (fi) 
= because the regularized measure depends on Mpy, not on /i. The results for the (3 
function obtained from both schemes differ by a sign, because the logarithms in the theory 
depend on \n(Mpy / fi). 

Equating the derivative of the logarithm of the measure w.r.t. in (7.21) Mpy to zero 
yields then 

M PVJ^T- (~ + 3N\nM P y-2N\ng) = . (7.23) 
oMpy \ g 2 J 



Hence 



^W^Hi^W 1 ' (7 ' 24) 



3 



2 



or, written in terms of a — J 



JLp = Mpy — a = w . 7.25 

2vr v dMpy 2f 1-^ K ' 



then the Pauli-Villars method yields further determinants det(M(, + Mp V ) +1 ' 2 and det(M/ + Mpy)^ 1 . 
The zero modes arc cigcnfunctions of Mb and Mf with eigenvalue zero, so their Pauli-Villars counterparts 
become nonzero modes with eigenvalues Mp V and Mpy. 
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This is the /3-function for pure A~ = 1 supersymmetric Yang-Mills theory. It is straightfor- 
ward to extend this result to pure A/"-extended supersymmetry with Af Majorana or Weyl 
fermions in the adjoint representation. One finds for SU (N) 

d a 2 AN — AfN 

MpV dM^ a = -2-.l-g(2N-NN) ■ (7 ' 26) 

It is clear that for Af = 2 there is only a one-loop contribution to /3, and for Af = 4 
the f3 function vanishes altogether. These are well-known properties of pure extended susy 
gauge theories. For Af = 1 one finds agreement for one- and two- loops. Beyond two loops 
the result for the beta function becomes scheme dependent, so it becomes then pointless 
to investigate whether agreement holds. 

Let us now add matter. In susy QCD with Nf flavours the matter part consists of Nf 
pairs of chiral superfields Q % and Qi with % = l,Nf in the N_ and N_* representations of 
SU(N). Each fermion in Q and Q has one zero mode, see (4.37) and (5.35), while the 
scalars do not have any zero modes. So the zero mode measure for the matter part is 
according to (6.26) 

/ l\ 2N f 1 Nf 
dM (matter) = ^-J ^—^ ^dK u dK u . (7.27) 

Renormalization leads to a further term in the measure, and thus in the f3 function. In 
susy only the kinetic term <j)e v <j) of the matter fields gets a Z factor 

C = Z0 ren e yren rell , = V^ren • (7.28) 

and rather than a factor for each fermion with one flavor, we now get in the measure 

a factor (ZMpy)~ 1 / 2 for each zero mode. (The Pauli-Villars field operator becomes ZMf + 

— 1/2 

Mpv, so the zero modes continue to produce a factor M PV in the Pauli-Villars sector, 
not {ZMpy)~ x l 2 . In the nonzero mode sector one can neglect the dependence on Mpy, 
and here the Z factors of bosons and fermions cancel due to susy) . 

For the gauge multiplet we factorized out a factor 1/g 2 in front of the action of all fields 
of the gauge multiplet, so that the fields gA^ = do not renormalize. (We use here the 

— 1/2 

background formalism in which Z g = Z A , where Z& is the wave function renormalization 
constant for the background fields.) Thus the renormalization of the gauge multiplet is 
taken care of by the renormalization of the factor 1/g 2 in (7.23). 

From here on we proceed as before. The measure for gauge group SU(N) with Nf 
flavors is now given by 

^ ±1 =e-^ (Mpi , r (i) (_y (7 . 29) 
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We denote the anomalous dimension by 7« where 33 

8 , 7 



d 

-M PV lnZ* = 7 (the same for i = 1, N f ) , (7.30) 

oM-- 



PV 



and obtain 



9 n ,„ d a 2 (3N-N f (l- ,, , 

—13 = M PV — a = } \ 7 — - . 7.31 

2vr v dMp V 2tt ^ 1 - ^ 1 

Expanding in terms of a, the result agrees with the results in the literature for the one-loop 
and two-loop (3 functions for iV = 1 susy QCD with Nf pairs of chiral fields Q l and Q, L . 
Namely, the one- and two-loop (3 function for an iV = 1 vector multiplet coupled to a 
chiral multiplet in a representation R, including the effects of the Yukawa couplings whose 
coupling constant is also g (in fact, the renormalized coupling constant gp), is given by 
[64] 

a 3 

+ — (-6C 2 2 (G) + 2C 2 {G)T{R) + AC 2 {R)T{R)) (7.32) 

For N f pairs of chiral matter fields E T (^) = N f, and C 2 (G) = N for SU(N). Using 
also that the anomalous dimension 7 = jj,^- \nZ for a complex fermion in the fundamental 
representation N of SU(N) is equal to — aC 2 (R)/Tc, we indeed find agreement. 34 

The f3 function in (7.31) can be rewritten such that only the numbers of zero modes 
appear. 

)9(«) = -^^-in/-iE^ + iET/) • ( 7 - 33 ) 

Here n g is the number of bosonic zero modes (AN),nf the total (gluino and matter) number 
of fermionic zero modes (2iV + 2iVj), and the sums J2 g and run over the gluon and 
fermion zero modes. For gluons and gluinos A, the anomalous dimension is the same (due 
to susy) and proportional to the (3 function 



lg = 7 A = P/a . (7.34) 

Substitution of this result yields back (7.31). This result does not yet agree with the 
results in the literature for the /9-function of gauge fields minimally coupled to scalars and 



33 0ften one defines 7 = ji-S- In \[Z\ here we follow [62] 



34 With the usual normalization 7 = (i-J^ In Z 1 / 2 is equal to 7 = =^C2(-R) [64]. 
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fermions, because in supersymmetry the Yukawa couplings between scalars and fermions 
have not an independent coupling constant A but rather A = g 2 . At the two- loop level one 
therefore gets extra contributions which one must add to the results from the literature, 
and then one gets complete agreement. 

8 J\f = 4 supersymmetric Yang-Mills theory 

An interesting field theory with instantons is the M = 4 super Yang-Mills theory [65]. The 
action is of course well known in Minkowski space, but instantons require the formulation 
in M = 4 Euclidean space. Due to absence of a real representation of Dirac matrices in 
four-dimensional Euclidean space, one cannot straightforwardly define Majorana spinors 
in Euclidean space. This complicates the construction of Euclidean Lagrangians for su- 
persymmetric models [66, 67, 68]. For M = 2,4 theories, one can replace the Majorana 
condition by the so-called symplectic Majorana condition and then one can define (sym- 
plectic) Majorana spinors in Euclidean space. Equivalently, one can work with complex 
(Dirac) spinors [69, 70]. In the following subsection we write down the action in Minkowski 
space-time and discuss the reality conditions on the fields. Next we construct the hermitean 
M = 4 Euclidean model via the dimensional reduction of ten-dimensional M = 1 super 
Yang-Mills theory along the time direction. One can also define a continuous Wick rotation 
for the spinors directly in four dimensions [68]. 

8.1 Minkowskian M = 4 SYM 

The M = 4 action in Minkowski space-time with the signature rf v = diag(— , +, +, +) is 
given by 

s = j 2 Jd A xtx ^-f^ - fxx p^ A - %x A p^x A$ + l - (D^ab) (d^ ab ) 

- V20 AB {\ a ' A , Af } - V2<f> AB {X% Aq,,_b} + I [<j> AB , CD ] [4>ab, 0cd] } • (8.1) 

The on-shell M = 4 supermultiplet consists of a real gauge field A^, four complex Weyl 
spinors \ a ' A (equivalently, four Majorana spinors) and an antisymmetric complex scalar 
4> AB with labels A, B = 1, ... ,4 of the internal R symmetry group SU(4:). The reality 
conditions on the components of this multiplet are 35 the Majorana conditions [\ a,A )* = 
-XX and (X A )* = X & ,a and 

4>ab = (O* = ¥abcd<P CD ■ (8.2) 

35 Unless specified otherwise, equations which involve complex conjugation of fields will be understood 
as not Lie algebra valued, i.e. they hold for the components X a ' a - A , etc. 
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These conditions are invariant under S77(4) transformations. The sigma matrices are 
defined by a^ af3 = (l,r l ), = (— l,r l ) for /j, — 0, 1,2,3 and complex conjugation gives 

(tr^y = <rj d = = e^e^^, with = = -e Q/3 = -e a/9 . Since AB is 
antisymmetric, one can express it on a basis spanned by the real eta-matrices (see Appendix 
B) 

<P AB = ^= {SV AB + iP'ff AB } , 4>ab = ^ {SSfc* - iPS^} , (8.3) 

in terms of real scalars S l and real pseudoscalars P\ i = 1,2,3. Because rf AB is self dual 
and fj lAB anti-selfdual, rf AB = rfAB and ff AB = —rfAB- Then the reality conditions are 
fulfilled and the kinetic terms for the (S, P) fields take the standard form. The action in 
(8.1) is invariant under the following supersymmetry transformation laws with parameters 

(a and (aA 

5 A, = -iQa^' A + i\ $>A ofc£ , 

s<p ab = V2(c> A \ B - r B \ A + , abcd QK,d) , 

5\ a ' A = -\^ V %F^ A - iyfiCw TH AB + \_4 >AB i ^bc] C' c , (8.4) 

which are consistent with the reality conditions. Let us turn now to the discussion of the 
Euclidean version of this model and discuss the differences with the Minkowski theory. 



8.2 Euclidean M = 4 SYM 

To find out the M = 4 supersymmetric YM model in Euclidean d = (4, 0) space, we 
follow the same procedure as in [65]. We start with the Af = 1 SYM model in d — (9, 1) 
Minkowski space-time, but contrary to the original papers we reduce it on a six-torus 
with one time and five space coordinates [69, 70]. As opposed to the action in (8.1) with 
the SU(A) = SO (6) i?-symmetry group, this reduction leads to a model with an internal 
non-compact 5*0(5, 1) i?-symmetry group in Euclidean space. As we will see, the reality 
conditions on bosons and fermions will both use an internal metric for this non-compact 
internal symmetry group. 

The Af = 1 Lagrangian in d — (9, 1) dimensions reads 

Cio = \-\*{\F MN F MN + *r M D M V) , (8.5) 
9w 

with the field strength Fmn = du An — 9nAm + [Am, An] and the Majorana-Weyl spinor 
W defined by the conditions 

r ii# = # ? & T C W = &Ur° = V . (8.6) 
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Here the hermitean matrix r 11 = *T is a product of all Dirac matrices, -T 11 = T° . . . r 9 , 
normalized to (*r) 2 = +1. Furthermore, is the charge conjugation matrix, satisfying 
C~r M = -r?C w . The T-matrices obey the Clifford algebra {r M ,T N } = 2t] mn with 
metric r] MN = diag(— , +,...,+). The Lagrangian transforms into a total derivative under 
the standard transformation rules 36 

5A M = tr M v , s# = -\F MN r MN c , (8.7) 

with r MN = \[r M r N - r M r N }. The susy parameter is a Majorana-Weyl spinor, ( = 
( T C W = (Hr and *r£ = (. To proceed with the dimensional reduction we choose a 
particular representation of the gamma matrices in d — (9, 1), namely 

r M = {r ® 7 5 , i [8 ]x[8] ® 7 M } , r 11 = r° . . . r 9 = 7 7 ® 7 5 , (8.8) 

where the 8x8 Dirac matrices 7° and 7 7 of d = (5, 1) with a = 1, . . . , 6 can be conveniently 
defined by means of 't Hooft symbols as follows 





T= =„ n h 7 7 =7 1 ...7 b = I (8.9) 



In Euclidean d = (6, 0) one defines 

r°' AS = (?7 feAB ,i77 fe ' AB ) 
^as = (-V k AB,ifj k A B ) (8.10) 

but in Minkowski space one puts a factor —i in front of the first one. So explicitly i7 a > AB = 

^^AB ^AB ^AB ^AB^ ^ = {ir) AB , -T) AB , -T) AB , ifj k AB } SO that \e A BC D S aCD = 

—S AB . The first three matrices 7 1 ,7 2 ,7 3 are symmetric while the latter three matrices 
7 4 , 7 5 , 7 6 are antisymmetric. Meanwhile 7^ and 7 5 are the usual Dirac matrices of d — (4, 0) 
introduced in (4.15). Note that in this construction we implicitly associated one of the 
Dirac matrices, namely 7 1 , in 6 dimensions with the time direction and thus it is anti- 
hermitean and has square — 1; all others (as well as all Dirac matrices in d — (4,0)) are 
again hermitean with square +1. Let us briefly discuss the charge conjugation matrices in 
d — (9, 1), d — (5, 1) and d = (4, 0). One can prove by means of finite group theory [71] 
that all their properties are representation independent. In general there are two charge 
conjugation matrices C + and C~ in even dimensions, satisfying C ± T^ = ± (-T M ) C ± , and 



36 After partial integrations, the Yang-Mills action transforms into [(rNip a )DMF a,MN and the Dirac 
action varies into -$ a r M ' {-\r p ® 'D M F PQ Q. The sum of these two variations cancels if one uses the 
Bianchi identity D[ M F P q] — 0. The variation of Am in the covariant derivative in the Dirac action cancels 
separately due to the 3-spinor identity ('4> a r M 'tp h )((r M 'tp c )f abc = which holds in 3,4,6 and 10 dimensions. 
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C + = C~ *r. These charge conjugation matrices do not depend on the signature of space- 
time and obey the relation C~ *T = ± (*r) T C~ with — sign in d — 10, 6 and + sign 
in d — 4. The transposition depends on the dimension and leads to (C ± ) T = ±C ± in 
d = 10, (C ± ) T = in d = 6, and finally (C±) T = -C ± for d = 4. Explicitly, the charge 
conjugation matrix C{ is given by Cq <E> 07 where 

4 = 77 = ( o ^ j' 6 =l<11 = {s^ o J • (8 ' u) 

Upon compactification to Euclidean d = (4, 0) space, the 10-dimensional Lorentz group 
50(9, 1) reduces to 5*0(4) x 50(5, 1) with compact space-time group 50(4) and R- 
symmetry group 50(5, 1). In these conventions a Weyl spinor \P in ten dimensions with 
16 (complex) nonvanishing components decomposes as follows into 8 and 4 component 
chiral-chiral and antichiral-antichiral spinors 

or more explicitly 

ij T = [(A al ,0);(A a2 ,0);(A a3 ,0);(A a4 ,0);(0,0);(0,0);(0,0),(0,0)] 

+ [(0, 0); (0, 0); (0, 0); (0, 0); (OA^); (0, A Q , 2 ); (0, A Q , 3 ); (0, V 4 )] ■ (8.13) 

Here \ a > A (a = 1, 2) transforms only under the first SU(2) in 50(4) = SU{2) x SU(2), while 
\ a ',A changes only under the second 577(2). Furthermore, \ a ',A transforms in the complex 
conjugate of the 50(5, 1) representation of \ a ' A . To understand this latter statement, 
note that the 50(5,1) generators are M ab = ^(7 a 7 6 — 7 6 7 a ), and 7 1 is antihermitian. 
Furthermore, 7 1 ,7 4 ,7 5 and 7 s are purely imaginary. Thus 

( 7 a )* = -SfS- 1 ; 5 = 7 2 7 3 . (8.14) 

This matrix 5 is not the charge conjugation matrix. The Lorentz generators M ab and 5 
are block diagonal 

where A7 ab = l(A7 a A7 6 - E b IJ a ) and E ab = \{E a E h - E h E a ) while 5 = -tfrf = rj 1 . It 
follows that 

s^s- 1 = -{s a y =>■ su^s- 1 = (u ab y , 
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SE^' 1 = -(E b )* SE^S' 1 = (E ab )* . (8.16) 

Thus the two spinor representations of 5*0(5, 1) are each pseudoreal (they are not real 
since S is antisymmetric), but they are not equivalent to each other. For SO (6) ~ SU(A), 
the two spinor representations are of course complex and inequivalent to each other. For 
S0(3, 1) the opposite is the case: there the two spinor representations are complex, and 
equivalent to each other under complex conjugation, (<x M „)* = a-2a tJlV a 2 because S = 72 is 
off-diagonal. 

Substituting these results, the Lagrangian reduces to 
C% =A = ^tr I \F, V F, V - f\i p a ,^ A - i\£ fl a(3 '\f,,A + \ {D^ab) {D^ ab ) 

- sT^ab {X a ' A , Af } - V2J> AB X a ,, B } + § [<j> AB , <P CD ] $ AB , 4>cd\ 

(8.17) 

where we still use the definition § A b = ^abcd(P CD ■ These scalars come from the ten- 
dimensional gauge field, and can be grouped into <p AB = -^E aAB A a , where A a are the first 
six real components of the ten dimensional gauge field Am- Using rf h = {^E aAB , -^E AB } 
with i] ab = (—1, +1, +1, +1, +1, +1) the vector indices are turned into SU(4:) indices. Writ- 
ing the action in terms of the 6 scalars A a , one of these fields, say Aq, has a different sign 
in the kinetic term, which reflects the S0(5, 1) symmetry of the theory. In the basis with 
the 4> AB fields, we obtain formally the same action for the Minkowski case by reducing on 
a torus with 6 space coordinates, but the difference hides in the reality conditions which 
we will discuss in the next subsection. The action is invariant under the dimensionally 
reduced supersymmetry transformation rules 

5 A, = -Ki^pX^ + iX^afd , 
H AB = V2 (C' A X B - C' B X A + e ABCD Q'X a ,, D ) , 
6X a > A = -\a» v %F^ A - iV2( a ,, B r a '<t> AB + C'° , 

$K>,A = -\^ U /F^ A + tV2C' B Pa'a^AB + [4>AB, L> ,C ■ (8.18) 

Again, these rules are formally the same as in (8.4). Note that the indices A, B are lowered 
by complex conjugation, but the spinor indices a and a' are lowered by e- symbols. 

8.3 Involution in Euclidean space 

The Majorana-Weyl condition (8.6) on & leads in four-dimensional Euclidean space to 
reality conditions on A" which are independent of those on X a r, namely, 

(X a > A Y = -X^ B e, aVBA , (X a ,, A y = -V^'V'^ • (8.19) 
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These reality conditions are consistent and define a symplectic Majorana spinor in Eu- 
clidean space. The SU (2) x SU (2) covariance of (8.19) is obvious from the pseudoreality of 
the 2 of SU(2), but covariance under 573(5, 1) can also be checked (use [r] a ,fj b ] = 0). Since 
the first £ matrix has an extra factor i in order that (r ) 2 = —1, see (8.8), the reality 
condition on <j) AB involves r\\ B 

(0* = ^««- (8-20) 

The Euclidean action in (8.17) is hermitean under the reality conditions in (8.19) and 
(8.20). For the a-matrices, we have under complex conjugation 

(<')* = W > (W (8-21) 

Due to the nature of the Lorentz group the involution cannot change one type of indices 
into another, as opposed to the Minkowskian case. 



9 Large instantons and the Higgs effect 

We have seen in previous sections that the instanton measure on the moduli space for 
pure 577(2) gauge theory with one anti-instanton {k — — 1) is given by (dropping overall 
multiplicative factors of two and n) 

dM oc d^^M% v ^ = ( ^) e -$«W,vr . (9 . 1} 

g 8 fV ^ p5g8 J 

The one-loop corrections coming from the determinants further modify the factor 4 into 
4 — | = y , see (7.19), and in addition yield some constants in the exponent. The integral 
over p, the instanton size, is clearly nonsingular for small p as long as asymptotic freedom 
holds 37 , but for large p it diverges severely. However, in a Higgs model, the mass term for 
the gauge bosons (C = — \A 2 l g 2 v 2 if there are no instantons) yields further terms of the 
form 

(Higgs) = ~2ir 2 v 2 p 2 + ... . (9.2) 

Thus for spontaneously broken gauge theories the p integral acquires a Gaussian cut-off, 
and yields a finite result. This solves the large-p problem for the electroweak interactions. 
For QCD the situation is more complicated; in fact, the large-p problem is presumably 
intimately related to confinement. We now give some details. 

37 0ne integrates p up to the renormalization scale /x, and instantons with scale p yield the prefac- 
tor cxp(— 8ir 2 /g 2 ). The g 2 in this prefactor depends on p, not on p. One finds then that g 2 (p) = 

87r 2 /(— Pi ln(pA)) where j3\ = —^Ci(G) H is negative if asymptotic freedom holds. So, if —j3\ + 3 > 0, 

there is no singularity at p = 0. 
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The Higgs action for an SU (2) Higgs doublet is given by 
C H = D^Dtf + \{y*ip - v 2 ) 2 



With < (p° >= v, the ordinary Higgs effect in Minkowski space gives a mass term C = 
— ^A^v 2 for the vector bosons 38 . We could also discuss other representations for the Higgs 
field but the analysis is very similar, and a doublet is of course the most interesting case. 
In Euclidean space we take for A^ the regular selfdual instanton solution with k — 1 

^ = -4^2" (9-4) 
x l + p l 

We next solve the ip field equation in this instanton background. For general A, an exact 
solution to the coupled equations seems out of reach. We therefore drop the potential 
term and only require that \<p\ — > |i>| at large |x| 39 . So this is not an exact solution, but 
the first term in an approximate solution. We shall discuss the higher-order terms later. 
As we shall show, the solution to the equation D^D^ip = 0, \<p(\x\ — > oo)| = v is of the 

form (p = f(r 2 ) I 4 ^ 3 ) . This clearly looks awkward, and a more covariant way to 
\ x + ix J 




^(^I- ( ^k;i. (9. 5 ) 



construct the solution is to write ip as 

fo,+) 

and to make the ansatz 

= vf{x 2 ) (a^/V**) ^ J j , (9.6) 

with f(x 2 ) — > 1 as x 2 — > oo. (Recall that one can always write (^„) as ^75(0" + ix ' <^)(o) 
and this yields the form of ip given in (9.5) up to an inessential factor 1). 

The function f(x 2 ) satisfies a second-order differential equation, but we do not analyze 
this equation, but present the result and check that it solves D^D^ip = 0: 

^ = -\[^Q^ ( 1 )= -r^—W ( 1 ) ■ (9-7) 

38 0ne usually decomposes tp° into ip° = — «X3)i see below, with < a >= v a . Then v 2 = \v 2 a , and 

the mass of the vector boson is rriA — \gv. 

39 This is 't Hooft's approach [4]. Note that the field equation for is not restricted due to the 
backreaction of the Higgs field. Affleck [72] considered instead the case v = 0, A arbitrary, in which case 
the usual instanton solution together with ip — solves the coupled equations. Both approached yield 
equivalent results. 
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The boundary condition is clearly satisfied because ^=f Q has unit norm. It is straight- 
forward to check that this expression for if satisfies the field equation. Namely, omitting 
the overall factor v and the spinor Q one finds 

n a . a^(x 2 + p 2 ) -x^x u a u (a^ u x u )(a p x p ) 



(x + p 2 f' 2 (x 2 + p 2 fl 2 

= a,p 2 /(x 2 + p 2 fl 2 

D,D,p = d,D,p + A^p = " 3 ^;^ } ^ 2 = • (9.8) 

Having found the solution of the field equation of the Higgs scalar in the background of 
an instanton, we now substitute it into the action to find the corrections to the classical 
action. The kinetic term only yields a surface integral due to partial integration 



J D^Dptp d 4 x = J dn^D^) 



T 



hm 2vrV) 3/ V 1 | ? vXv A= , o T&T L, 2 1 I =2vrVp 2 
J y/x 2 + p 2 V^ 2 (x 2 + p 2 fl 2 \ ( 



(9.9) 



This is the extra term mentioned in (9.2). 

However, the contribution of the term with the potential is divergent 



2 2 \ 2 

A J {ip* ip - v 2 ) 2 d 4 x = \J ( d 4 x = oo . (9.10) 

The reason for this divergence is clear: we did not solve the full field equation, but rather 
took the instanton solution of pure Yang-Mills theory, and solved the field equation for the 
scalar in this background, omitting the potential term. 

We enter here the difficult area of "constrained instantons" [72, 73]. There does not 
exist an exact and stable solution of the coupled field equations, as can be shown as 
follows. Suppose there was a solution with p ^ 0, and a finite but nonvanishing action 
for the scalars. If one replaces A^(x) by aA^ax) and (p(x) by ip(ax) (which preserves the 
boundary condition \<p\ — > v) then the action becomes upon also setting ax = y 



S d (a) = J d 4 y 



-^KM + l 2 \DMy) I 2 



i 



2\2 



+ ^\(p>*(y)<p(y) - v 1 ) 
a 4 



(9.11) 



Note that all three terms in the action are positive. Replacing A^(x) by aA^ax) for a near 
unity amounts to a particular small variation of A M , and similarly for tp. So one can make 
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the value of the action slightly smaller by making a slightly larger then unity. This proves 
that no solution exists. In fact, if a tends to infinity, we approach the bound S = 8n 2 /g 2 , 
but this bound can never be reached. The expression for aA^ax) is equal to the instanton 
solution with p replaced by p/a, and for a — > oo we get a zero-size instanton. That leaves 
open the possibility that a local minimum might stil exist, but detailed analysis shows that 
this is not the case. This scaling argument is called Derrick's theorem [74], and often yields 
valuable information without having to perform integrals. 

One can still use an approximate solution to find a large part of the contributions to the 
path integral, and this approximate solution is obtained by first inserting a constraint into 
the path integral which yields an exact solution, and then to integrate over this constraint. 
The idea is as follows. There are one or at most a finite number of directions in field 
space along which the action decreases ("destabilizing directions", in our SU(2) model the 
directions parametrized by a). Deformations in all other directions increase the action. The 
constraint prevents deformations in the destabilizing directions, and on first minimizes the 
action with the constraint present. The solution is called the constrained instanton. It looks 
like the instanton for pure Yang-Mills theory at short distances but decays exponentially 
at large distances. It has a particular value of p. Finally, one integrates with the measure 
for the zero modes over all values of p. The expectation is that this should capture most 
of the path integral, even though one is not expanding around a solution of the theory 
without constraint. For the SU(2) instanton one may add a term o\ J d 4 x[trF 3 — c±p~ 2 ] 
to the action to constrain deformations in the direction of the gauge zero mode (d/dp)A°^, 
and a term a 2 [J d 4 x((p*Lp — v 2 ) 3 — C2P~ 2 } to freeze deformations in the directions of the 
matter zero mode (d/dp)ip cl , with ip d given by (9.7). One might fix the values of c\ and c 2 
such that the constraint is satisfied for the instanton solution and cp in (9.7). The Lagrange 
multipliers U\ and <7 2 are then fixed order by order in perturbation theory, by requiring 
suitable boundary conditions for the deformations. 

The result is that one can make an expansion of the full approximate solution in terms 
of pv and finds then the following results in the singular gauge [72, 73, 32]: 

(i) inside a core of radius p = where mw = gv, the approximate solution given in (9.7) 
is still valid 

(ii) far away the solution decays exponentially, A^ ~ exp(— raw|x|) and \(p—v\ ~ exp(— m H \x\) 
with ttih = 2\f\v. 



(iii) the integral over \D^\ 2 has the same leading term 2n 2 p 2 v 2 + Ol X(vp) A ln(vp 



Hence, the Higgs effect indeed solves the large p problem, and asymptotic freedom solves 
the small p problem. Constrained instantons are also relevant for Af = 1,2 SYM theories. 
They can also be studied in the context of topological YM theories, as was discussed e.g. 




but the potential term is now convergent and yields a result 0(X(vp) 4 \n(vp\/X) 
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in [75]. 



10 Instantons as most probable tunnelling paths 

Instantons of nonabelian gauge theories can be interpreted as amplitudes for tunnelling 
between vacua in Minkowski space with different winding numbers Q. We shall deter- 
mine a path in Minkowski spacetime which yields the "most probable barrier tunnelling 
amplitude". We follow closely [76], but related work is found in [77, 78]. 

We begin with one particular path Aj^ = {Aj(x, t), A Ij0 (x, £)} from which we construct 
a class of paths which all differ by how fast one goes from one configuration at ti to the 
next at ti- Namely, we make a coordinate transformation from t to X(t) in Minkowski 
spacetime and consider the following collection of paths 

A\ X \x,t) = Mx,\(t)) ; A™(x,t) = A Ifi (x,X(t))X(t) (10.1) 

(Often one works in the temporal gauge A^ = because this makes the physical inter- 
pretation clearer. All our results are, however, gauge invariant). The case X(t) = t yields 
the original path, but different X(t) yield paths which all run through the same sequence 
of 3-geometries Aj(x, ti), Aj(x, £2), Aj(x, £3) . . . but at different speeds. The variable X(t) 
can be considered as a kind of collective coordinate which measures a kind of continuous 
winding number because we will start with one winding number and end up with another 
winding number. For £ between t\ and £2 this continuous winding number is due to an 
integral J d 3 x f* dt'd^j^ over a surface where A^ is not everywhere pure gauge. Only for 
£ = ti and t = £2 does A^ everywhere on the surface become pure gauge and only at these 
times the winding number is an integer. These initial and final configurations describe 
vacua of the theory in Minkowski spacetime. We can also consider another particular path 
An^ = {Au(x, £), Anfi(x, £)}, and then we can in the same way create a second class 
of paths, parametrized again by the function X(t). In this way we generate an infinite 
collection of classes of paths. 

For a given class A^\x,t), we can substitute A^ and A^ into the action, and then 
we obtain, as we shall show, the Lagrangian for a point particle (one dynamical degree of 
freedom) 

L=\m(X)X 2 -V{X) (10.2) 

where m(X) and V(X) depend on the choice for A^. We shall then determine for which 
m(A) and V(X) the tunnelling rate is maximal. The solution of this problem in Minkowski 
space involves instantons in Euclidean space. A crucial role is played by the notion of a 
winding number in Minkowski space, so we first discuss this subject. 
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One can define a winding number Q in Minkowski space in the same way as in Euclidean 
space because Q does not depend on the metric (in technical terms it is an affine quantity) 



^ 64tt 2 J ^ pa 

^ J(tiF, u e^F pa )d A x 



32^ 
-1 

471-2 



trE-Bd A x (10.3) 



where d A x = d 3 xdt and e 0123 = +1, and we used that = A®T a with T a = — \o a so that 
tr(T a T b ) = —\5 a b and the structure constants are given by [T a ,T b ] = e ab c T C) so f ab c = e ab c . 
Furthermore, by definition Ej = —F j and Bj = \ejkiFki- Because we are (and stay all the 
time) in Minkowski space, e 0123 = -e i 23 = +1 and -F^F^ = 2F 2 - Ff-. The integral is 
taken between two 3-dimensional hypersurfaces U\ and a 2 at t\ and t 2 . 

If at ti the configuration A^x^t) describes a vacuum, it has by definition vanishing 
energy. Since the energy density 40 is given by Ti, = \(E a ) 2 + ^(B a ) 2 , vanishing energy 
means F* v = 0, hence A^ is pure gauge at t — t\ 

A„(x, h) = e -«(^i)^e Q(5 ' tl) . (10.4) 

Similarly, at t 2 we have A /Ji (x,t 2 ) = e~^ x,t ^d,j,^ x ' t2 \ We now choose the temporal gauge 

Ao(x,t) = 0. (10.5) 

Having fixed A Q = 0, there are still residual space-dependent gauge transformations possible 
because they preserve the gauge A = 0. To check this statement is easy: 

A' (x , t) = e-^doe 9 ^ = . (10.6) 

We use these residual gauge transformations to set a(x, ti) = 0. 41 Then A^(x,ti) = for 
all fi and all f. 



40 The gravitational stress tensor is = F^ p F^ - \ri^F^F a 'P a and T 00 = \{E a f + \(B a f. 
One can also obtain T M „ from canonical methods as follows. Evaluating H = pq — L with q = Aj 
and p = —Ej one finds upon using that Aj — Fqj + DjAq and partially integrating that H = 
I W {(Ej ) 2 + (Bj ) 2 } + Aq(D j Ej)j d 3 x plus a boundary term. For solutions of the field equations such 
as the vacuum, D 3 Ej = 0. For configurations with finite energy (E = O^i) the boundary term vanishes 
when A falls off like O(^). Moreover in the temporal gauge the last term vanishes. Actually, according 
to the Dirac formalism, the Gausz operator D 3 Ej is a first-class constraint, and should be omitted from 
the Hamiltonian. Thus. H — J [^(E") 2 + ^(£>") 2 ] d 3 a; also according to canonical methods. 

41 With Aj(x,h) = c -^(fM)Q je »(xM) we get A'j = e^^e-"^* 1 )^"^* 1 ^ 9 ^) and clearly Aj = if 
we take e 9 ^-* to be the inverse of e™' 1 '' 1 '. 
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Note that even if there is winding in the vacuum at t — t\ (such winding at one fixed 
time is discussed below (10.13)), one can still gauge it away by a time-independent gauge 
transformation, but then the winding at t = t 2 increases by just the same amount. This is 
as it should be, because the total winding is gauge-invariant. 

We shall consider paths from <Ji to 02 which at every time t have finite energy (finite 
integral J (E 2 + B 2 )d 3 x). This means that the energy density for fixed t must tend to zero 
for |x| — > 00 (to make the integral J (E 2 + B 2 )d 3 x convergent), hence at large |x| the gauge 
fields become pure gauge 

\x\— >oo 

But since A (x, t) = 0, we see that a(x, t) is independent of t. Because a(x, ti) = we 
obtain a(x,t) = for all t and \x\ — > 00. This means in particular that at t 2 for large \x\ 
the gauge fields tend to zero 

Aj(x,t 2 )- ^0 (10.8) 

\x\^oo 

The fact that for large \x\ all Aj vanish allows us to compactify the 3-dimensional 
spacelike hypersurfaces at fixed t into spheres S3. The north pole of each sphere corresponds 
to all points with \x\ = 00, and at this point on 63 all Aj vanish. Thus, all 3-spaces at 
fixed t compactify to a sphere S3. We summarize the results in a figure 




(10.9) 

Everywhere on the boundary of this cylinder the gauge fields vanish, except at the disk at 
t = t 2 , but there A = and Aj are only pure gauge. 

We now return to Q. First of all, Q can be written as a total derivative, using the same 
algebra as in Euclidean space 

Q = ^ vpa [ d,tr \A v d p A a + \A v A p A a \^x . (10.10) 
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(we recall that tr [A ll A u A p A IJ e' lupa ] = 0). Furthermore, since on the boundary F pv = 0, we 
can replace d p A a by —A p A a in (10.10). We then find 

Q = J dor M tr [AuApAr] A v = e- a d u e a . (10.11) 

Since Aq = 0, there is no contribution from the sides of the cylinder, and since Aj = at 
the bottom, there is also no contribution from the bottom. Hence in the gauge we have 
chosen, all contributions to the winding come from the top of the cylinder: 

Q = ^^^ ijk J Tr(e- a d i e a )(e- a d j e a )(e- a d k e a )d :i x . (10.12) 

At the top of the cylinder the 3-space t = compactifies to a sphere 5*3 (space). The 
map from this 3-sphere into the group SU(2) is a map from one S3 to another S3 42 because 
(i) we can always compactify the R 3 with coordinates x to an S3 and (ii) the gauge fields 
at \x\ — 00 are equal (and vanish) 



S 3 (space) 




m 3 at t=t 2 



(10.13) 

The maps S3 (space) — > S3 (group) in Minkowski space fall into equivalence classes with 
a winding number fceZ, just as the maps of instantons in Euclidean space give maps from 
S3 (space) — > S3 (group). In the latter case S3 (space) is the boundary of all of R 4 while 
here it is the compactification of the whole R 3 at t — ti- It follows that 

Q = ±k k e Z . (10.14) 

42 Thc matrix elements of any 2x2 complex matrix can be written as g — a p a^ with = {a, 1} and 
/i = 1,2,3,0. Unitarity requires that = a*cr^ equals g~ l , hence g^g — \a p \ 2 + (cijCikicjki + ®o a i + 
a^ao)(j l = 1. Hence |ao| 2 + \a,k\ 2 — 1 and the coemcients of 07 must vanish. The determinant yields 
det g = a§ — a 2 ., and since also |ao| 2 + \ak\ 2 — 1, requiring det g = 1 leads to = ±i|a/c| and ag = ±|ao|. 
Then we are left with g = a$I + ia^ak with real oq and afe satisfying a 2 , + a 2 = 1 which defines S3. 
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We now can draw a picture of the energy H = J 7id 3 x at times t as we move from t = t% 
to t = t%- Initially and at the end one has H = 0, but in between we must have H > 
(note that 7i > 0) for the following reason. 




t 



(10.15) 

There are no paths possible which connect the vacuum at t\ to the vacuum at ti which 
are solutions of the field equations because if F^ u = on o\ (or er 2 ) and the field equations 
are satisfied, one has F^ v = everywhere 43 . But, if F^ u would vanish everywhere, Q ~ 
J E ■ Bd 4 x would vanish, hence one could not change the winding number. The conclusion 
is that paths which go from one vacuum with winding number zero to another vacuum 
with nonvanishing winding number necessarily have positive energy at some intermediate 
times. 

We are now ready to define a subset of paths which depend on one collective coordinate, 
and to which (we claim) we can restrict our attention. Consider first one given path 
corresponding to a fixed field configuration Aj(x, t). Instead of this single path, we consider 
the set of paths Aj\x, t), as defined in (10.1). Each path is labeled by a different function 
A(t), and is defined by 

Af\x,t) =Aj(x,\(t)) . (10.16) 

As we already discussed, for A(t) = t we recover the original path, but for different A(i) we 
obtain paths which run through the same 3-dimensional configurations A(x, ti), A(x, £2), A(x, t 3 ) . . . 

43 For the proof, note that if at t\ one has F^ v = and at all t one has D^F^ — 0, then d^F^j = 
doFoj = at t\. Furthermore, the Bianchi identity D^Fij + DiFjo + DjFoi = yields d$Fij — 0. Hence 
doF^u — at t\. Also djF^ u = because = at t = t\ for all x. Hence dpF^ = at t = t\ 
for all p,iJ.,v- We can rewrite this as (DpF^) = at t = t\. Next we repeat this analysis by noting 
that also D^DpF^) = at t = t u because Dp(D p F^) = [D^D^F^ + D p (D^F» u ) and = 

everywhere. This shows that do(D p F 0: >) = at t = t±. To also show that do(D p Fij) = at t = t\ we 
rewrite d (D p F i: j) = -d Q DiF jp - d D 3 F pl and then use Z? (A^) = [-Do, + D^DqF^) = 0. In 

this way we get OqF pl , = for any n. Hence F^ v — at all t. 
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at different speeds. For example if X(t) is constant for some time interval, the correspond- 

— * 

ing A(x,t) do not change, but if A(t) changes rapidly, the sequence of A(x, t) is traversed 
rapidly. 

Each path Aj(x, X(t)) should begin at Aj(x,ti) and end at Aj(x, t 2 ), so we require 
A(ti) = t, and X(t 2 ) = h, but between t\ and t 2 the function X(t) is arbitrary. We shall 
later take t\ = — oo and t 2 = +00, and then also require that X(ti) = —00 and X(t 2 ) = +00. 
Given a path A^p (x, t) we can compute the electric and magnetic fields 

dA 

-Ej = F 0j = d Af\x,t) = -^-(x, X(t))X because A (x,t) = 

oX 

Bi = ^e ijk F jk = ^eijkidjA^x, X(t)) + Aj(x, X(t))A k (x, X(t)) - j ^ k) . 

(10.17) 

The Lagrangian L = j Cd 3 x with C = ^tr F* v = ^tr (E 2 — B 2 ) can then be written 



as 



L = ^m(A)A 2 




m(A) = 

V(X) = / trfi 2 d 3 x > (10.18) 



The momentum conjugate to X(t) is p(X) = -^L = m(X)X. Hence 



For a given path A*p (x, t) one can plot H as a function of t, and one finds then the profile 
in figure (10.15). 

We have thus isolated a class of paths A^(x,t) which depends on one collective coor- 
dinate X(t). For one given A(x,t), this still yields an infinite set of paths, but all these 
paths run through the same set of 3-configurations Aj(x,ti), Aj(x,t 2 ), . . .. These are, of 
course, infinitely many other collective coordinates which describe a general path Aj(x,t), 
but the idea is that X(t) is the relevant coordinate to describe tunnelling, while the other 
collective coordinates describe variations away from the paths A^ (x, t) which give only 
small corrections to the results obtained from X(t). It is, of course, difficult to prove this 
assertion; one could begin with two collective coordinates as a start, but even this would 
lead to a complicated analysis. 

The action for X(t) in (10.18) can be viewed as the action for one point particle. This 
particle feels the potential barrier V(A), and to go from the vacuum at t — t\ with V(A) = 
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m(A) = to the vacuum at t 2 with also V(A) = m(A) = 0, we need tunnelling. The 
tunnelling rate R in quantum mechanics is proportional to e~ 2R where 

R = J d\y/2m(X)(V(X) - E) , (10.20) 

Ai 

with A(*i) = Ai = fi,A(* 2 ) = A 2 = t 2 and V(A(ti)) = V(X(t 2 )) = and m(A(*i)) = 
m(X(t 2 )) = 0. We also set E = because we consider tunnelling from one vacuum (with 
E — 0) to another. 

There is, of course, an important difference with ordinary quantum mechanics. The 
point particle X(t) feels a potential V(A), but both are derived from the same object, the 
fields Aj(x, X(t)). In addition the mass is here "position" -dependent, m = m(X). One can 
show that in quantum mechanics the formula for R also holds if the mass m(X) depends 
on the point particle A(t)). The crucial step is now to pose the question: for which set of 
paths A(x, X(t)) is the tunnelling rate maximal? The tunnelling rate for the quantum 
mechanical particle X(t) can be described by Minkowski path integrals, so we ask: for which 
A(x,t) is there least destructive interference of the associated paths A(x, X(t)) in the path 
integral? Clearly, V(A) should be as small as possible, but it cannot be too small because 
it must produce winding. 

The tunnelling rate is e~ 2R where according to (10.20) 



A 2 

R= I dA 2 



Ai 




d s x 




tr B 2 d 3 x 



1/2 



9 Z 



t-2 

I dt[(tr j E 2 tfx)(tY J M 2 d 3 x)]V 2 . 



(10.21) 



We replaced dX by dtX and brought A inside the square root. The fields E and B still 
depend on X(t). Since tr J a(x)b(x)d 3 x is an inner product, while j tr E ■ B is proportional 
to the winding number according to (10.3), we have the triangle inequality 

*2 



R > 



9 Z 



(trE-B)d 4 x 



9 



Q 



tl 



Hence the tunnelling amplitude is bounded from above by 



C - R < -e 



.8^ 

9 



\Q\ 



10.22) 



(10.23) 



The inequality is saturated when E is parallel to B at each vector x and at each time 
t : E(x,t) = a(t)B(x,t). The claim is that among all paths with the same Q, the paths 
with the smallest R are the paths with E parallel to B. 
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Let us discuss the meaning of this result. Paths which interpolate between vacua with 

— * — * 

different winding number must produce electric and magnetic fields E and B in between at 
finite x and t which cannot be too small, namely | J (E^B^d^x] should be equal to 87r 2 |Q|. 
On the other hand, the tunnelling rate is proportional to the length of E a times the length 
of B a , so to make the tunnelling rate as large as possible, the product of these lengths 
should be as small as possible. One could set up a variational problem for R under the 
constraint that J tr E ■ B d 4 x be equal to 4ir 2 Q, but we shall not work this out. 

The bound is reached, namely the tunnelling rate is maximal, when the set of paths 
Aj(x, X(t)) produces parallel electric and magnetic fields 

E(x, X(t)) = a(t)B(x, A(f)) • (10.24) 

Of course, a(t) can also be viewed as a function of X(t) because A(t) is just another 
parametrization of the time interval. Note that this condition does not change if one 
changes the parametrization from X(t) to another function X'(t), because under such 

— * 

reparametrizations E scales by a constant factor d\'/d\, which cancels the Jacobian in 
(10.20) for this change of integration variables. We use this scaling property to select a 
particular X (t) such that E(x, A (t)) = ±B(x, A (t)). The property of E and B being 
parallel is also a gauge- invariant property, and C and R are of course gauge-invariant. So, 
our characterization of paths with maximal tunnelling rate is gauge-invariant, as it should 
be. Thus the use of temporal gauge did not restrict the generality of the results. 

We now can establish the connection between tunnelling and instantons. The fields 
for which E and B in Minkowski space are parallel are closely connected to instantons 
in Euclidean space. Namely, among the class of paths A(x, X(t)) parametrized by X(t), 
there is the path E(x,X (t)) = B(x,X (t)) (and another path with another A' (t) such 
that E(x, X (t)) = —B(x, A (t))). If we then define Euclidean gauge fields A^(x,t) by 
Af(x,t) = Aj(x,X (t)) and Af(x,t) = A (x,X(t))^ then this A^(x,t) is self dual. The 
parameter t is Minkowski time, but in the expressions for Af(x, t) we should interpret t as 
the Euclidean time. 

Summarizing: the most probable tunnelling paths are given by the set of paths Aj(x, X(t)) 

— * — * — * — * 

with parallel E and B fields. A given class of paths with E parallel to B contains one path 
which, when viewed as a configuration in Euclidean space, is an instanton. Conversely, 
given an instanton A^(x,t) in Euclidean space, one can construct a corresponding set of 
paths A*f(x, X(t)) in Minkowski space by setting 

A^ x \x,t) = Af(x,X(t)) 

A ( M ' {x \x,t) = Af(x,X(t))X. (10.25) 
As an example we take the Q — — 1 anti-instanton solution in regular gauge, A^ = 



73 



-a pv x v I \x 2 + p 2 ), see (3.41), which yields the following set of paths in Minkowski space 

A Q (X,t) = g2 +X \^2 +p2 Kt) 

A^ x \x t) — a ( f )° ! ~" x ° ! 



\{t — > — oo) = — oo 
\{t — > +00) = +00 . 



10.26) 



We are clearly not in the temporal gauge, but since our results are gauge-invariant, it does 
not matter which gauge we use. We still have A p — > at large \x\, so that we still have 
the notion of winding as a map from S3 (space) into S3 (group) at each time. 
Straightforward calculation yields for the curvatures in Minkowski space 

lip 2 o\ 



F01 = d Q A x - diA Q + [A , Ax] = 
F23 = d 2 A 3 - d 3 A 2 + [A 2 , A 3 ] = 



Hence 



E 



-1ip 2 a 



(x 2 + A 2 + p 2 ) 



,-A; 



B 



(x 2 + A 2 + p 2 ) 2 
2ip 2 a 1 
~~ (f 2 + A 2 + p 2 ) 2 

2ip 2 a 
{x 2 + X 2 + p 2 ) 2 ' 



A, 



;i0.27) 



;i0.28) 



which depend on x 2 = x 2 + A(t) 2 (not on x 2 — t 2 ). Hence, E is indeed parallel to B (in 
fact, anti-parallel). 

The winding number Q can be written in two ways 



Q 



-1 
-1 

24T 2 ' 



[trE -Bd 3 x]dt 



c tiupcr 



d^tr [A v A p A a ]d A x 



;i0.29) 



In the latter expression Q receives only a contribution from the boundary, 44 but in the 
former expression we compute Q by integrating over all space and time. It is then natural 
to define a t-dependent function by integrating only up to a time t 



-1 
4^ 



1 



4tt 2 



trE ■ Bd^x 



dX / d x- 



24p 4 



x 2 + A 2 + p 



214 



p 4 dA 



(A 2 + p 2 ) 5 / 2 



44t 



which is 



4 For example, the contribution to Q from the surface at t — t\ is proportional to J ^ t2+ g2 +p 2^3 
nonvanishing. On the other hand, the contribution to Q from the sides of the cylinder converges for large 
\tl 
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X/p 



dy 



(y 2 + I) 5 / 2 



with x 



A 



10.30) 



Clearly, q(t) is gauge- invariant and has the following form 




(10.31) 



It only receives contributions from regions where E and B are nonvanishing, hence where 
is not pure gauge. 

To obtain the action for A(t) in this example we evaluate 



C = --M{E 2 -B 2 ) = -n* 

g 2 r 

Doing the space integral we obtain 



A 2 



1 



(x 2 + p 2 ) 4 (x 2 + p 2 ) 4 



L = ^m(A)A 2 - V(X) = g2(A f^ 2)5/2 (A' - 1) 



where we used 



/ 



d 3 



x 



4lT 



(x 2 + A 2 + p 2 ) 4 (A 2 + p 2 ) 5 / 2 2 J (y 2 + l) 4 32 (A 2 + p 2 ) 5 / 2 



y 2 dy 47r 2 



(10.32) 



(10.33) 



(10.34) 



In this example, we were dealing with a gauge with Aq ^ 0. We can map to a gauge in 
which Aq = by a suitable large gauge transformation 



(7 = exp 



arctg 



A(t) 



a/o 72 + p 2 a/x 2 + p 2 
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(10.35) 



Indeed, using the expression for A in (10.26) 

— IX ' (J 

A <> = ffTxWT? Mt) • (10 ' 36) 

one finds that A' = U~ l (d t + A )U = U' x d t U + A vanishes 

= — i + A = o (10 37) 

where we used that A commutes with U . Of course, Q is gauge invariant because it can 

— * — * 

be written as a trace over E ■ B but it is instructive to see what happens if one writes 
Q as a surface integral and makes a gauge transformation with U . On the boundary of 
Minkowski space the A^ = V^d^V transform into (VUY l d^VU and the winding number 
of VU is the sum of the winding numbers of V and U. However, U is connected to the 
identity element: U = expa[^^= arctg ] traces an orbit as a runs from to 1 

which begins at the identity element and ends at U . Thus U does not produce any winding, 
and thus the answer for Q from the total derivative is the same, whether one uses a gauge 
in which A vanishes or a gauge in which A is nonvanishing. Note, however, that when 
A 7^ one gets contributions from the timelike part of the boundary of the spacetime 
cylinder. 



11 False vacua and phase transitions 

In spontaneously broken gauge theories, the potential has a local maximum and an absolute 
minimum. These extrema form a metastable and a stable vacuum, respectively. If a system 
is in the metastable vacuum at all points in spacetime, it could at some point and at some 
time, say x = and t — 0, make a quantum fluctuation to the stable vacuum. This 
transition costs energy, but if the region around x ("the bubble") is large enough, the 
energy needed for creation of a bubble (this energy is located in the boundaries of the 
bubble) is less than the energy gained by tunnelling to the lower vacuum (this energy is 
liberated in the volume of the bubble), and then the bubble will rapidly expand. In fact, 
since the rate of energy production increases the larger the bubble, the bubble will spread 
through space, with accelerating speed, converting the false vacuum to a true vacuum. As 
an application of this process one may consider the universe just after the Big Bang; at 
high temperature the universe is in the symmetric vacuum, but as cooling due to expansion 
sets in the potential develops a lower (true) vacuum, and if for some reason the universe 
remains stuck in the false vacuum, one can study the decay of the universe towards the 
true (asymmetric) vacuum. We shall consider another example: the perturbed double-well 
potential, with two classically stable minima, but one minimum (the true vacuum) below 
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the other minimum (the false vacuum). We shall study the decay of the false vacuum in 
this model into the true vacuum [79, 80]. We follow [81]. 

As a preliminary to the calculation of the phase transition in field theory, we first revert 
to quantum mechanics and study the double-well. Let us pretend that we do not know 
that there are big differences between the double-well potential and the following potential. 



We can then repeat the calculation of the nonperturbative corrections to the energy of 
the ground state. Already at this point it is clear that we should not blindly repeat all 
steps because previously we were dealing with two perturbatively degenerate vacua, and 
the kink-instantons provided the energy shift between both vacua. In the present case, 
the degeneracy is already broken at the classical level. Proceeding nevertheless we find a 
classical solution of the Euclidean equation — + = describing a point particle x(t) 
in the inverted potential and use path integral methods. 



V{<p) 




(11.1) 




x 



(11.2) 
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The particle starts at t = — oo in the point x — a, rolls to the point x — c, "bounces" at 
time t — X, and ends up at t = +00 at the same point x — a. Clearly, X is the collective 
coordinate for this classical solution x d (t). We then get for the "one-bounce solution" 

Too = <x = a\ e~^ Hro \ x = a >= e^ l T 0V ^S^I , 
J = N J dg(r)es^ 2) with q(±r 0/2 ) = , (11.3) 

n.z. 

where we used the Faddeev-Popov trick, and "n.z." indicates that the path integral is over 
the solutions of the field equation for the fluctuations about x c i(t) in the space orthogonal 
to the almost-zero mode. Assuming again that Jo can be written as a factor K times the 
path integral of the harmonic oscillator we get 



T - K Pp-!-o . K det(-^ + ^) 

Continuing without further thought we would sum over multi-bounces and obtain 

v n=0 

= ^-e-^ T0 exp(KT e^ Scl ) . (11.5) 
Tin 

Using the same arguments as used before for the unperturbed double-well potential, we 
would conclude that the ground state energy is given by 

E = ^hu-hKe^ Scl . (11.6) 

However, at this point we note that there are problems with this result 

(i) first a small problem: the nonperturbative correction is exponentially suppressed, hence 
it should be neglected compared to the perturbative correction. 

(ii) a more serious problem (actually a virture, as we shall see) is that K has a negative 
eigenvalue. This is easy to prove: ^x c i(t — X) is the zero mode fluctuation. It has a 
mode because x c i bounces: unlike the kink, x c i(t) moves first forward and then backwards, 
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yielding a kind of kink-antikink solution. 




(11.7) 

Hence there exists one mode for the fluctuations with lower eigenvalue and without a node, 
and since ^x c /(x — X) has zero eigenvalue, there exists an eigenfunction for the fluctuation 
with negative eigenvalue. Thus the nonperturbative correction is imaginary, reflecting the 
fact that the perturbative ground state near x = is nonperturbatively unstable 

ImE = h | K | e^ Scl = T/2 . (11.8) 

So, instantons (or rather bounces, still solutions of the classical field equations with finite 
action) yield in this case the width T of the unstable state. 

Having seen that in quantum mechanics the path integral approach to nonperturba- 
tive corrections to the vacuum energy leads to the correct result that the ground state is 
unstable, we now return to the problem of phase transitions. 

As a toy model for studying such decays we need a system with at least one space 
coordinate because bubbles have a finite extension in space. The simplest choice is a 1 + 1 
dimensional field theory. We choose the double-well potential with an extra term to destroy 
the degenacy between both minima. Since the double-well potential is symmetric under 
(p — > —(p, the extra term should be antisymmetric, and if it is to be a small perturbation 
compared to the leading \(p A term, we need either a term linear in (p or cubic in <p, or both. 
It simplifies the mathematics if we keep the local minima of the perturbed potential at the 
same place as the minima of the unperturbated potential, namely at <p = ±/x/ vA. We are 
then led to the following model 
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where we take B small and positive. For constant tp, the solutions of the classical field 
equations occur at 

£) + (11 ,o, 

and from this result it is clear that the values ip = ±fi/y/\ are indeed extrema. The 
potential has the following form 




ni.ii^ 



It vanishes at <p — -/i/vA because we added the constant |i?(/i/-\/A) 3 , but at (p = fi/y/X 
it is negative. Thus <p = —fi/y/X is the unstable vacuum and (p = fi/y/X is the stable 
vacuum. The value of the potential at the stable minimum is 



V(<p = n/Vx) = -6 = ~B (ji/Vxf . (11.12) 



There is a relative maximum a bit below the maximum of the symmetric potential V(B = 
0, p) at p = 0; for small B it occurs at tp ~ —B/X and its value is |/i 4 /A — \e + 0(B 2 ). 
These results are intuitively clear: if one pulls p down at /i/vA by an amount e, then the 
maximum at (p = is pulled down half as much, and moves of course a bit to the left. 

In addition to the three solutions of the classical field equations with constant ip(ip = 
—fi/y/X, (p = fi/y/X, and (p ~ —B/X), there is an exact kink-antikink solution. This is clear 
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by inspection of the inverted potential 




<p 



(11.13) 

A ball at rest at <p = —fi/\^X at x = — oo starts rolling down to the hill and up the other 
hill; it reaches the point where V(ip) = at x = and then returns and comes to rest at 
(p = —fi/y/X at x — +oo. The classical solution ipd( x ) is thus a soliton of the following 
form 



<Pcl O) 




;n.i4) 



We approximate <fci( x ) by the following expression 



tanh 



m 



(x + X c 



i ( m i 
tanh — [x 

V 2 v 



;n.i5) 



This is a static soliton in 1 + 1 dimensions, which can also be viewed as an instanton in 
x-space. (In the quantum mechanical models we considered previously, we dealth with 
instantons in Euclidean time). Near x = —X c the antikink is exponentially suppressed 
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and the mass of the kink is M. Between the kink and antikink <p is equal to Li/y/X~ (up 
to exponentially suppressed corrections), and near x = X c we have an antikink with mass 
M. For large x we find the correct asymptotic value (pa(x — > ±00) = —fx/y/X. We fix the 
value of X c such that the total energy of (p c i(x) (which is the energy of the ball rolling up 
and down the hills in (11.13)) vanishes 

E = 2M -2eX c = , (11.16) 

3 

where M — ^ is the classical mass of a single kink. Hence, the separation between the 
kink and antikink is given by 2X with X = M/e. 

The exact solution begins at V — 0, climbs the hill, and comes down on the other side 
where it reaches the value V = 0, and then it returns, climbing the hill once more, and 
ending at V = 0. The approximate solution comes down to V = — e after climbing the hill, 
but it has more energy in the kink (and antikink) region, such that in both cases the total 
energy is zero. 

We now compute the transition amplitude from the unstable vacuum ip = —fx/y/X to 
the kink-antikink solution (the bubble). Once a bubble has formed, it will rapidly grow 
(the kink and antikink move increasingly fast away from each other, i.e., X exponentially 
increase). 

This is a tunnelling process because classically it is forbidden but quantum mechanically 
allowed. If the field <p at x — starts making a transition from the metastable vacuum 
to the stable vacuum, it must first climb the potential barrier, but when it comes down 
in the true vacuum energy density — e is gained. However, as we already mentioned, it 
takes energy to distort the field in order to go from one vacuum to another; this is just the 
energy (mass) of a kink and of an antikink. These energies are located at the boundary of 
the bubble (around the centers of the kink and the antikink). Once in a while there occurs 
a quantum mechanical transition to a bubble which is large enough that e2X is larger than 
2M; in that case the bubble does not collapse but grows increasingly rapidly. 

Note that we do not tunnel from the state (p(x) = —fx/y/X to the state <p(x) = fx/y/X 
because the energy difference of these states is infinite (namely e times the volume of x- 
space, so 2Le with L — > 00). When we discussed the unperturbed kink, the vacua tp = 
±/j,/\/x were exactly degenerate, and in such cases the true vacuum is a linear combination 
of these vacua which can be determined by tunnelling from one vacuum to another. 

The intermediate configuration with the kink and antikink moving away from each 
other can be described by Lorentz boosting the kink to a velocity — v and the antikink to 
a velocity +v 



<Pd(x,t) = 



. m ( x + X c + vt\ m ( x — X — vt 

tanh — ; — tanh — — , — 1 



2 V J 2 V yfl 



v 



2 



(11.17) 
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For constant X the boost of the kink is again a solution because the field equation use 
relativistically invariant. However, since X itself is expected to change with time, we 
denote X + Xt by X(t) and obtain then 



(Pd(x,t) = 



Vx 



tanh 



m f x + X(t) 

2" WT^T 2 



tanh 



m 



x 



X(t) 



a/i-a 2 



(11.18) 



The distance between the kink and antikink is now 2A(t). The Lagrangian for this 
approximate solution is obtained by substituting ip d into the action. The calculation of 
the first two terms is straightforward. Taking twice the result for a single kink yields 



/i 2 m 2 



dip 
dx 



dx = 2 / dx 



1 



A 



+ 



' cosh 



\ 2 

XX(x + X) \ fi 2 m 2 



m x+\ 



2 y/l^ 
1 



A 4 (1 - A 2 ) 3 / 2 y A 4 i_A 2 



(11.19) 



The calculation of the contribution from the nonderivative terms splits into two parts: 
from the region between the kink and antikink we obtain a term e2A, while from each of 
the two walls we find a term \M\/ 1 — A 2 as we now explain. Around x = —A and x = +A, 



the integral J V(ip)dx with tp = -*j= tanh 



m x+X 



can be evaluated as follows. The integral 



f°° W 2 (<p)dx with <p = -£=tanh^(x* + A*) with x* = -£= and A* = is equal 

to (J \U 2 (Lp(x*))dx*)\/l — A 2 . From equipartion of energy for a static kink we know that 
the integral / \U 2 (^(y))dy equals \M. Thus / {\XJ 2 ){y d )dx = |M\A - A 2 . 



around kink 



Hence, neglecting term with A, we find 



L = - 



m 



oo 

Vl-A 2 / 



dy 



8A \ m " ./ cosh 4 ?/ 

30 

= —2M\/ 1 - A 2 + e2A . 



+ e2A - mV 1 - A 2 



;n.2o) 



The Hamiltonian follows from p 



8L _ 2MX 
9 * 



and reads 



H 



2M 



e2A = vV + AM 2 - e2A . 



\/l - A 2 

We can split if into a kinetic term A' and a potential term 1/ 

A = vV + 4M 2 -2M= -p 2 /M + C(p 4 ) , 

2 



(H-21) 
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V(X) = 2M-e2\. (11.22) 

This formula for V^(A) is valid when the bubble is reasonably large: when A is larger than 
the kink size (when the bubble is larger than the thickness if its walls). For smaller x we 
expect that V(A) rises from till a maximum value when the bubble is formed, and then 
decreases as the bubble gets larger 




;il.23) 



The value X c corresponds to the classical solution, with energy E = and constant X, 
corresponding to the ball rolling in the inverted potential. For this case, p = 0. Quantum 
fluctuations with X < X c produce only bubbles which collapse since their potential energy 
is positive, but bubbles with X = X c are metastable (they have constant X = X c so p = 0), 
while for X > X c the bubble expands. 

We now treat H as the Hamiltonian of a point particle which sees the potential V(X) 
and has energy zero. We find with the WKB approximation for the tunneling amplitude 



A = exp 



exp 



p | dA 



^4M 2 - (e2A) 2 dA 



;il.24) 



where we used that H = = a/p 2 + 4M 2 — e2A. Since X c = — , we have 
A = exp 



i 



M 2 \M 



o 



exp 



2M 2 



1 — y 2 dy 



exp( 



^7 



;n.25) 
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Hence, the rate of the transition to the true vacuum is exp — ZL ^- per second and per unit 
volume. (To evaluate the integral we set y = cosy?). 

We end this section with a few comments. 1. The decay of the false vacuum per unit 
time and per unit volume is of the form T/V = Ae~ B ^ n (l + 0(h)). We computed B. For 
A see [80, 76]. 

2. We used energy conservation to determine how fast a bubble expands. However, we 
neglected radiation of mesons. In general, when the false vacuum collapses to the true 
vacuum, mesons will be created, and thus the bubble will expand less rapidly. 

3. Above we considered the critical bubble: a static solution of the classical field equations 
which describes a bubble which has just the correct form and size that it is metastable. 
For larger sizes there is no static solution, but one can consider the creation at t = of a 
large bubble which then expands. This is an initial value problem: tp(x) is given and also 
|^ = at t = 0. One can define the size of a bubble for example as Q = /^(y? + ~^%) 2 & x - 
Far away, y? = — -^=, so Q is finite for bubbles. A problem we now want to solve is: given 
the size Q of a bubble, for which shape is its action minimal. (Minimal action in Euclidean 
space means maximal tunnelling rate). This will yield a one-parameter parametrization of 
bubbles; the parameter is a collective coordinate X(t), and having found the solution, we 
can then compare our ansatz in (11.15) and see how good the ansatz was. Mathematically, 
we can formulate this problem as a variational problem with a constraint. Introducing a 
constant Lagrange multiplier a we consider the action for the variational problem 

C = 4(d*tf-US-£) 2 -B(W-£<p 



The equation of motion 



4 v a; V3 a 



ddC 9£ =0, (11.27) 



dx dip x dip 
has a first integral due to equipartition of energy 



(11.28) 



The integration constant vanishes for bubbles. Introducing a field if which vanishes for 
large x 

y5 = V? + ^, y? = ^3--^, (11.29) 
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we obtain 

dm f\ I / 9n\ 2 ZLR /1 // \ 9rv 

(11.30) 



dip 
dx 




For a = B = the solution is the kink, but for a ^ we get bubbles. One can actually 
solve this equation exactly by using (see Gradhstein and Resznik, page 84, 2.266) 

dy 1 , arc cosh ^ , (11.31) 



y\Ja + by + cy 2 Va y^f-Aac ' 

which holds if a > and 6 2 — 4ac > 0. This corresponds to < a < 2/i 2 . By writing the 
differential equation as 




dip 



2a 
A 



we obtain for the bubble with fixed size and minimum action 

2a + b(p 
(p\/Aac — b 2 

2a Tx(V-a) 



;n.32) 



cosh 




V^ac - b 2 cosh (v^y^a: - Xoj - b V® cosh(^(x - x )) + 2/i 

(11.33) 

The constant a lies in the domain < a < 2/i 2 . For a = we find y3 = ^= ^or = ^ j 

while for a = 2/i 2 we find <p> — ^or </; = — In between, we have bubbles of finite 

extent; for small a the function ip remains constant for a long time (the bubble) and then 
it falls rapidly off to zero (due to the cosh). This is the same behaviour as displayed by 
our ansatz in (11.15). 



12 The strong CP problem 

The vacua \n > of Yang-Mills theory in Minkowski space with winding number n all have 
the same energy, namely zero (because they are vacua). We recall that at fixed time space 
was compactified to an S* 3 which was mapped to the S 3 of the group manifold of SU(2). 
Since there is tunnelling as we have discussed, the physical vacuum is a linear combination 
of all of them. Since they all appear on equal footing, we expect that the generator T for 
large gauge transformations which change the winding number, defined by T\n >= \n+l >, 
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commutes with the Hamiltonian. Hence T maps the physical vacuum into itself. It follows 
that T|vac >= e l(f \yac > with (p some phase. The solution of this equation is 



| vac >=| 9 >= e \n > ■ (12.1) 

n 

Indeed T\9 >= E e ine | n + 1 >= e~ ie \ 9 >. 

Instead of using the infinite set of states in (12.1), we can work with the ordinary vacuum 
| > if at the same time we add a term 

g 2 

£e = —Qqcd iq jt 2 ^ 1c ^ u (12-2) 

to the action. This term yields a factor e md in the action e^ s if one is in the vacuum with 
winding number n. We shall set h — 1. Note that we are in Minkowski space and that Cq 
is hermitian. 

Strictly speaking, we should first make a Wick rotation to Euclidean space because we 
can only discuss instantons in Euclidean space, but Cq has the same form in Euclidean 
space: one gets a factor i from d 4 x and another factor i from F 0i . Together with the 
factor jr in e^ s one gets the same factor e md in Euclidean space. The ^-term is a total 
derivative, and usually one discards total derivatives in Lagrangians because fields vanish 
at infinity, but for instanton backgrounds one finds of course a nonvanishing contribution 
due to winding. 

The 6>-term clearly violates parity P. It conserves charge conjugation symmetry C, hence 
it violates CP. The strong interactions described by QCD are not supposed to violate P or 
PC, hence #qcd should be very small. However, the observed 9 parameter contains more 
than only 6*qcd- There is a second origin for a 0- angle coming from the electroweak sector: 
the manipulations leading to the CKM matrix. Recall that the mass terms of the quarks 
in the Standard Model come from Yukawa couplings 



m,n 



g(qu) 




+g'{qu) mn I ^ \ I '"n I ?kn + hx - ' ( 12 - 3 ) 



Ql. 



rn 



where g(qu) are the Yukawa couplings to quarks, and h + , h° are the two components of the 
complex SU(2) Higgs doublet. Furthermore m — 1,2,3 labels the families, so qi denotes 
the up quark while q[ denotes the down quark. When h° gets a vacuum expectation value 
< h° >= -^v, one obtains mass matrices M for the (w, c, t) quarks and M' for the (d, s, b) 
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(12.4) 

One diagonalizes them 
handed quarks 45 

(12.5) 

The mass matrix for the quarks becomes then diagonal with real masses 

q L , m M mn q Rn = {U L q L )D{U R q R ) (12.6) 

and similarly for q^m-^mn^Rn- So, the physical quarks are Ql = UiqL and Qr = UnqR, 
and similarly for Q' L and Q' R . 

If one rescales q^ to Ql, and q R to Qr, three things happen 

(i) the quark mass terms are diagonalized as we have discussed, yielding real physical quark 

masses 

(ii) a phase 5 appears in the CKM matrix which describes electroweak CP violation 

(iii) a new term is produced in the action by the Jacobian for these chiral rescalings. This 
new term is again proportional to f F^F^d^x with a coefficient which we call — #ew- 
Hence, now the action contains the sum 9 = 9qcd + #ew 

Cb = -(Oqcb + Oew)^~ 2 J {F^Fl^x . (12.7) 

There is no reason that 9 stTOQg = 9q CD + 6> EW vanishes, yet, as we now discuss, this seems 
to be the case. 

We can make a final chiral rescaling of the 3 light quarks (the u, d and s quarks) such 
that the ^-term is entirely removed. Rescaling the left-handed quarks by U(l) factors 
e vp u ^ e i Vd anc j e up 3 ^ j aco bians for these rescalings yield a term 

~(<Pu + ¥d + <Ps) ^2 tr F nv* F iw , (12.8) 

which cancels the 6*-term if ip u + ifd + tp s = 9 strong . Because the action is invariant except 
for the mass terms, only the transformation of the mass terms yields a new term in the 
action. In the diagonal mass term 

m u uu + m d dd + m s ss , (12.9) 

45 A complex matrix M can always be written as VH where V is unitary and H hermitian. This is the 
generalization to matrices of the decomposition z = e lv p of complex numbers. Then H can be diagonalized 
by a unitary matrix, H — UrDU^ 1 , and Ul is given by VUr. 



quarks, where 



M mn = —Qmn , M' mn = —=h 



These matrices are in general arbitrary complex 3x3 matrices, 
with 3x3 unitary matrices which are different for left- and right- 

UlMUr 1 = diag(m u ,m c ,m t ) = D , 
U' L M'U' R X = diag(m d , m s ,m b ) = D' . 
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the rescalings yield, to first order in ip u , ip d , ip s , a new term in the action 

£cp violation = iip u m u uj 5 u + i(p d m d dj 5 d + i(p s m s s^ 5 s . (12.10) 



The </?'s are only constrained by ip u + ip d + ip s = strong , so we can still choose them such 

8m d m s 
m u m d +m u m a +m d m s " 



that this new term is SU(3)v invariant. Namely if (p u = Sm d m a an( ^ C y C ij ca ny 



for ip d and <p s , then 

£cp violation = ; ; {UI5U + drisd + S75S) . (12.11) 

m u m d + m u m s + m d m s 

This term is hermitian and SU(3)y invariant, but it violates P, and since it conserves C, 
it also violates CP. The original #-term in the action in (12.2) has been transformed into 
the masslike terms in (12.11). No longer does one have to deal with total derivatives, but 
an ordinary extra masslike term has appeared in the QCD action. There is no reason that 
Strong should be small, but one can compute the electric dipole moment of the neutron 
which is nonzero if Strong is nonzero, and since experimentally the electric dipole moment 
has a very small upper bound, one finds that Strong is incredibly small 

Strong < 10- 9 . (12.12) 

The problem why # str ong is so small is called the strong CP problem. Note that it has 
nothing to do with the CP violation due to the phase 5 in the CKM matrix, which is an 
electroweak effect. Also the electroweak CP violation is very small; it can be parametrized 
by the area of the unitarity triangles (each of the 6 unitarity triangles has the same area 
2J in the Standard Model) 

J = (3.0 ± 0.3)10~ 5 . (12.13) 



13 The U(l) problem 

In this section we discuss an application of instantons in QCD. 

In the 1960's, in the absence of a renormalizable theory of the strong interactions, 
current algebra was developed as a method to derive information about matrix elements 
of currents, mostly the vector and axial-vector Noether currents which correspond to the 
(approximate) rigid flavor symmetry of the up, down and strange quarks. In terms of 
modern QCD, the action for the strong interactions reads 

£ = -|(^J 2 -£^W\ (i3.i) 

where % = l,...,Nf labels the flavors. One can consider either two very light quarks (u 
and d), or three rather light quarks (u, d and s). Decomposing the massless quarks into 
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left-handed and right-handed parts, their action becomes 

£(quarks) = ^ L M ~ $i,R Mr ■ (13.2) 

It has clearly a rigid Ui(Nf) x Un(Nf) symmetry group, where Ul acts only on ip L and Ur 
only on ijj l R . Instead of Ul and Ur we consider the vector part Uy(Nf) and the axial vector 
part UA{Nf). The vector part transforms ip L and ip R the same way, while they transform 
oppositely under U a- The total number of symmetries and group parameters has not 
changed, but physically Uy and U a are very different. The SU(2)v part of the symmetry 
is realized in Nature, and yields the SU(2) classification scheme for quark hadroscopy. 
The U(l)v corresponds to baryon-number conservation which is also (very well) satisfied. 
The SU (2) a symmetry is spontaneously broken, and the corresponding Goldstone bosons 
form an SU(2) multiplet of pseudoscalars (the pions and the 77 meson). One might be 
inclined to apply the same reasoning to the U(1)a symmetry, and argue that it, too, must 
be spontaneously broken because there is no doubling of multiplets with opposite parity 
observed in nature. However, the U(1)a symmetry is explicitly violated by the presence of 
instantons in QCD, leading to the instanton-induced six-fermion interaction in the effective 
action. This solves "the U(l) problem" that no isoscalar Goldstone boson exists in Nature 
[82]. There is a pseudoscalar meson, the 77 with a mass of 478 MeV. It cannot be the 
Goldstone boson because from current algebra S. Weinberg has shown that the mass of 
such a Goldstone boson has to be smaller than ^m^, far below the mass of the rj meson 
[83]. (The 77 meson can still be made of a quark and an antiquark, so the usual SU(2) 
scheme is still applicable - only this 77 meson is not a Goldstone boson) 46 . 

The axial-vector isoscalar current associated with the U A iX) symmetry is j 5 = £V ^ilhl^ % '■ 
It has an Adler-Bell-Jackiw chiral anomaly 

d'jf = N f ^- 2 e^F^F; a , (13.3) 

where F" denotes the field strengths of the gluons, and g% is the QCD coupling constant. 
In a QCD instanton background, integration over spacetime yields 

J ^Q 5 dt = N f k , (13.4) 

where k is the winding number. To make sense of this equation one should first integrate 
in Euclidean space to obtain a non- vanishing expression for the right -hand side in terms of 
the winding number k of the QCD instanton, and then Wick-rotate so that the left-hand 
side can be written as J d 4 xd^j^ oc J ^.Q 5 dt. The conclusion is that Q 5 is not conserved 
because k can be different from zero. For further discussion of the U(l) problem we refer 
to [82], [84, 85], or to the lecture notes by Coleman in [2]. 

46 0ne can extend this discussion to Ul(3) x Ur(3) with pions, kaons and 77 now 8 Goldstone bosons, 
and the rj' with mass 958MeV taking the place of i]. This 77' is an SU(3) singlet. 
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14 Baryon decay 



In this section we present an application of instantons to the gauge fields of the electroweak 
sector of the Standard Model. 

In an instanton background with winding number k, massless (or approximately mass- 
less) fermions in the fundamental representation of SU(N) have \k\ zero modes, see (4.37). 
In the electroweak SU(2) W x U(l) theory (the subscript w stands for weak), quarks and 
leptons are in the fundamental representation (doublets) of SU(2) W . In Euclidean space 
the integration over zero modes of these quarks and leptons has dynamical consequences 
which we shall derive, but of course real quarks and leptons live in Minkowski space and not 
in Euclidean space. We assume that the Green functions in Minkowski spacetime can be 
obtained from those in Euclidean space by analytic continuation. Ideally we should prove 
that the Euclidean results give the main contribution to processes in Minkowski space in 
the same way as this was shown for tunnelling, but as far as we know this has not been done. 
Since processes involving electroweak instantons are suppressed by a factor exp ^— ^^-\k\j 
with g 2 the electroweak 577(2) coupling constant, we only consider instantons with \k\ = 1 
made from W + ,W~ and W° bosons. Then the left-handed quark doublets Q 1 ,) and ( s c ,) 
each have 3 zero modes because there are 3 colors, while the lepton doublets (g_) L and 
(^-) L each have one zero mode. The primes on d! and s' denote Cabibbo-rotated quarks 

d! — d cos 9 C + s sin 9 C 

s' = s cos 8 C — d sin 9 C (14.1) 

with 6 C = 13° the Cabibbo angle. As we shall explain, this Cabibbo rotation makes it 
possible for a neutron and a proton (six quarks together) to decay into two antileptons [5] 

p + n -> e + + (or /i + + v e ) . (14.2) 

In these instanton- induced processes, the electron number E, muon number M, up plus 
down number, and charm plus strangeness number change as follows 

AE = AM = 1 , Au + Ad' = 3, Ac + As' = 3. (14.3) 

The decay of a proton with (u, u, d) and neutron with (u, d, d) quarks into e + and u^, or 
into /i + and u e , can be described by a local vertex operator with 3 up-quark fields with 
different colors from Q 1 ,) doublets, and 3 down-quark fields also with different colors from 
( s c ,) doublets, and further one field from each of the two lepton doublets. This operator is of 
course nonrenormalizable, but it can be used in effective field theories for phenomenological 
purposes. Although this efective operator is derived from field theory in the sector with an 
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instanton, once it is obtained one can add it to the effective action and then forget about 
the existence of instantons. We now derive these results. 

The U(1)a symmetry has at the perturbative level an anomaly. There are triangle 
graphs with an anomaly: one vertex of the triangle graph is given by ffi = ^ s V ,s 757 /i ?/> s 
(where s = 1, .. . , Nf and Nf is the number of flavors, 3 in our case if we restrict our 
attention to the lightest quarks u, d and s). The one-loop perturbative chiral anomaly is 
then given by 

^J'i 5) = iNf^G^G% , (14.4) 

where G a ^ v is the W-boson field strength and gi the coupling constant of the SU (2) weak 
interactions. (This is thus the abelian flavor U(1)a anomaly. The nonabelian anomaly for 
the rigid flavor group vanishes because it is proportional to the trace of T a of the flavor 
group, which vanishes). 

If one integrates over space and time, the anomaly equation becomes 

/ i Qi5) ^I dt ij d3x{tfo) = 2Nfk ■ (145) 

The instanton number k counts the number of left-handed fermions minus the number 
of right-handed fermions, and in ordinary perturbation theory (with k — 0) for massless 
quarks, this difference is thus conserved. However, in an instanton background (k ^ 0), the 
chiral charge of the vacuum at t = — oo changes to a different chiral charge of the vacuum 
at t — +oo: AQ^ = 2Nfk. The conclusion is that the perturbative anomaly, and the 
violation of the axial charge which occurs when one tunnels from one vacuum to another, 
are related! Both are different aspects of the same chiral anomaly. The perturbative 
anomaly occurs when fields are small, so the winding vanishes and one is in the k — 
sector. The nonperturbative anomaly is due to the same axial-vector current but now 
in the background of instantons which cannot be viewed as small and tending to zero at 
infinity, since they must produce winding. 

One may at this point wonder whether the Higgs effect which gives the H^-bosons a 
mass, in such a way that they vanish exponentially at large distances, does at the same 
time destroy the concept of winding. There is no contradiction. When we discussed the 
large instanton problem, we chose the regular gauge for the instanton to simplify the 
calculations. However, exponential fall-off only occurs in the singular gauge. In that case, 
the winding takes place at the origin, as we discussed in the introduction. In this section 
we use the regular gauge and then there is winding at infinity even in Higgs models. 

To saturate the integrations over the Grassmann collective coordinates, one needs 6 chi- 
ral quark fields in a correlator (one for each zero mode). Each field has a mode expansion 
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into a zero mode and all nonzero modes, but the integration measure d/C over the Grass- 
mann variable /C in the mode expansion picks out only the zero mode. Then the integration 
over collective coordinates gives as result the product of the 6 zero mode functions. If we 
put one SU(2) W doublet with one up quark and one down quark at a point x±, a second 
pair a x 2 , and a third pair at £3, 47 and the instanton is at x , we find from (5.35) for large 
separations (x 2 » p 2 ) the factor 48 

His^r <146) 

So if one computes some correlator in a theory with instantons, six quark fields from the 
correlator are needed to saturate the Grassmann integrals, and the remaining fields are then 
treated as in ordinary field theory (with propagators and vertices). Thus instantons induce 
a term proportional to Yli=i (x -xof * n ^ ne effective action which describes the annihilation 
of 6 quarks. Further there are a matrices and other constants which are also due to the 
zero mode function. 

One can now construct an effective local 6-quark vertex V at a point x which yields 
the same results in a theory without instantons as one obtains in a theory with instantons 
if one integrates over the fermionic collective coordinates of the quarks. This vertex must 
contain 6 quark fields which contain the 6 different collective coordinates, hence it has the 
form V = u^' 1 u^ 2 u^ 3 d S £ 1 d^ 2 (f£ 3 T a i3 7 g e ^ where T is a numerical tensor. Contraction of 6 
"probe-quarks" at positions xi,X2,x$ with V at xq using ordinary flat space propagators 
(a .-so) 3 f° r mass l ess quarks in a trivial vacuum precisely reproduces the result for the 
correlation function of the 6 probe-quarks in an instanton background centered around xq, 
provided the form of T is correctly chosen. 

These new vertices lead to anomalies in the baryon currents and lepton currents. In 
particular, the rigid U(1)a symmetry is explicitly broken by the presence of the interaction 
V in the action, and as we discussed in the previous section, this solves the U(1)a broken. 
As we already mentioned, a proton and a neutron (two baryons equal six quarks) may 
annihilate to form two antileptons (an e + or a p + , and an anti-neutrino). However, due to 

47 A massless complex Dirac spinor contains two Weyl spinors which are decoupled from each other 
ipDpxpD — ipLpi^L + i^Kp'tpR- Each has a zero mode. However, since only left-handed quarks couple 
to the W gauge fields, only left-handed quarks feel the presence of instantons, and so we neglect the 
right-handed quarks in this discussion. 

48 The down quark is contained in the s' of the doublet (c, s'). This is an SU(2) W doublet, and the 
instanton is an SU(2) W instanton. Although the c quark is heavier than the s quark, one can still view 
them as massless compared to the scale 250GeV of electroweak interactions. Massive spinors in an instanton 
background have no zero modes, as one may show by adding a mass term to (4.17) and (4.18). We assume 
that for such a broken SU(2) W doublet there still exists approximately a zero mode. 
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the incredibly small prefactor exp ^— ^jr\k\ j , where g 2 is the SU (2) weak coupling constant, 
these processes are not observable. 

15 Discussion 

In this chapter we have reviewed the general properties of single Yang-Mills instantons, 
and have given tools to compute non-perturbative effects in (non-) supersymmetric gauge 
theories. However, we have not discussed several other important or interesting topics: 

• Perturbation theory around the instanton: the methods described here enable us to 
compute non-perturbative effects in the semi-classical approximation where the coupling 
constant is small. It is in many cases important to go beyond this limit, and to study sub- 
leading corrections that arise from higher order perturbation theory around the instanton 
[58, 57]. Apart from a brief discussion about the one-loop determinants in section 7, we 
have not really addressed these issues. 

• Multi-instantons: we have completely omitted a discussion of multi-instantons. These 
can be constructed using the ADHM formalism [31]. The main difficulty lies in the explicit 
construction of the collective coordinates in an instanton solution and of the measure of 
collective coordinates beyond instanton number k — 2. However, it was demonstrated that 
certain simplifications occur in the large N limit of M = 4 SYM theories [25] , where one can 
actually sum over all multi-instantons to get exact results for certain correlation functions. 
For reviews on the ADHM construction in super Yang-Mills theories, see e.g. [32, 25, 86]. 
The same techniques were later applied for M = 2, 1 SYM [87, 88], and it would be in- 
teresting to study the consequences of multi-instantons for large N non-supersymmetric 
theories. For a review on instantons in QCD, see for instance [89]. 



A Winding number 

For a gauge field configuration with finite classical gauge action the field strength must 
tend to zero faster than x~ 2 at large x. For vanishing F^ V) the potential becomes then 
pure gauge, — U~ l d^U . All configurations of A^ which become pure gauge at infinity 
fall into equivalence classes, where each class has a definite winding number. As we now 
show, this winding number is given by 

where *F tlu = \e^ Uf „F p(7 and T a are the generators in the fundamental representation of 
SU(N) , antihermitean NxN matrices satisfying tr T a Tb = — \5 a b- This is the normalization 
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we adopt for the fundamental representation. The key observation is that *F pv F pu is a 
total derivative of a gauge variant current 49 

tr*F = 29 M tr e^ vpc {A u d p A a + ^A u A p A a } . (A. 2) 

According to Stokes' theorem, the four-dimensional space integral becomes an integral over 
the three-dimensional boundary at infinity if one uses the regular gauge in which there are 
no singularities at the orgin. Since F pu vanishes at large x, one may replace d p A a by 
—A p A a , and since A p becomes a pure gauge at large x, one obtains 

k = ^ 2 l d^ W tr { {U- l d u U) {U- l d p U) (U'^U) } , (A.3) 

Z47T ,/ £3 ( spacc ) 

where the integration is over a large three-sphere, S 3 (space), in four-dimensional Euclidean 
space. To each point x^ on this large three-sphere in space corresponds a group element U 
in the gauge group G. If G — SU (2), the group manifold is also a three-sphere 50 S 3 (group). 
Then U(x) maps S^space) into S* 3 (group), 51 and as we now show, k is an integer which 
counts how many times S* 3 (space) is wrapped around S* 3 (group). Choose a parametrization 
of the group elements of SU(2) in terms of group parameters 52 (i = 1,2,3). Then 

the functions C( x ) ma P x hito SU(2). Consider a small surface element of S 3 (space). 
According to the chain rule 

tr {{U- l d v U) (U-%U) {U- l d a U)} 

<K ' ° iJ ' :>e tr {(U~%U) {U-'djU) (U~'d k U)} , (A.4) 



dx v dx p dx a 



and using 
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AQ p = \e lxaPl Ax a AxpAx 1 , (A. 5) 



49 Note that *F lLV F llv is equal to 2e^ pa - {d^A^dpAa + 2d p A u A p A a - + A p A„A p A a } but the last term 
vanishes in the trace due to the cyclicity of the trace. 

50 The elements of SU(2) can be written in the fundamental representation as U = ao^+iJ2k a k T k where 
Tk are the Pauli matrices and ao and a,k are real coefficients satisfying the condition ag + Y^k a 1 — 1- This 
defines a sphere S' 3 (group). (If the a's are not real but carry a common phase, one obtains the elements 
of U(2)). 

51 There is actually a complication. Far away A^ — U~ 1 d p U but in order that U be only a function on 
S' 3 (space) it should only depend on the 3 polar angles but not on the radius. Hence A r = U~ 1 d r U should 
vanish. We can make a gauge transformation with a group clement V such that A' r = V^ 1 (d r + A r )V 
vanishes. The V which achieves this is the path ordered integral along the radius from the origin, V = 
Pexp — J r A r dr. Note that U is only defined for large r, but V must be defined everywhere, and V ^ U. 
In fact, V does not have winding since it can be continuously deformed to the unit group element. The 
winding number is computed in the text for UV, but since k in (A.l) is gauge invariant, k is also the 
winding number of the original gauge field A p . 

52 For example, Eulcr angles, or Lie parameters U = clq \ + i ^ k akTk with ao = \J\ — 'J2 k a|. 

53 For example, if the surface element points in the x-direction we have Ail = AyAzAr if £1234 = 1. 
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with le^pae^a^ = and A^A^A^ = e ijk A 3 £, we obtain for the contribution Ak of 
the small surface element to k 

Ak = ^e^tr { (U-%11) (U^djU) (U~ x d h U) } A 3 £ , (A.6) 

where k = j>g3( space ) Ak. The elements (Z7 — 1 (^)^Z7 (^)) lie in the Lie algebra, and define the 
group vielbein e°(£) by 

{U~ l d % U) = e a M)T a ■ (A.7) 

With e tjk e'^e b j el = (det e) e abc , we obtain for the contribution to k from a surface element 
AQ^ 

Ak = -^- 2 (det e) tr {e abc T a T b T c ) A 3 £ = - (det e) A 3 £ . (A.8) 

We used that for SU (2) we have [T a , = e a b c T c . As we have demonstrated, the original 
integral over the physical space is reduced to one over the group with measure (det e) d 3 £. 
The volume of a surface element of S* 3 (group) with coordinates d£* is proportional to 
(det e) d 3 £ (called the Haar measure). Since this expression is a scalar in general relativity, 54 
we know that the value of the volume does not depend on which coordinates one uses except 
for an overall normalization. We fix this overall normalization of the group volume such 
that near £ = the volume is A 3 £. Since ef = 6? near £ = 0, we have there the usual 
Euclidean measure d 3 £. Each small patch on S 3 (space) corresponds to a small patch on 
S" 3 (group), Ak ~ Vol(A 3 £). Since the C/'s fall into homotopy classes, integrating once over 
S" 3 (space) we cover S 3 (group) an integer number of times. To check the proportionality 
factor in Ak ~ Vol (A 3 £), we consider the fundamental map 

U(x) = ix^a^/Vx* , U' 1 (x) = -ix^a^/Vx* . (A.9) 

where <r M denotes the 2x2 matrices (a, i) with & the Pauli matrices, and = (a, —i). 
This is clearly a one-to-one map from S* 3 (space) to S 3 (group) and should therefore yield 
| A; | = I. Direct calculation gives 

U~%U + = " Wv/x 2 , (A.10) 

where a^ u is defined in (B.8). Substitution into (A. 3) leads to k — — ^ § di^ x^/x 4 = — 1 
making use of (B.18). 55 To obtain k — 1 one has to make the change a <-> a or x <-> — x in 
Eq. (A.9). 



54 Under achange of coordinates £ = £(£') at the point £, the vielbein transforms as e"(£) = %rre'j 1 (£(£'))> 
hence det e(£) = ^det det (e' (£')), while d 3 £ is equal to | det J|v|d 3 £'. For small coordinate transfor- 
mations detd£/d£' is positive, hence dete d 3 £ is invariant. 

55 0nly the commutator of the first two matrices in tr (a^ a a p pa ai )x a x l3 x 1 contributes because the anti- 
commutator is proportional to the unit matrix. In the result only the anticommutator gives a nonvanishing 
result, because the commutator yields term proportional to a a p whose trace vanishes. 
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Let us comment on the origin of the winding number of the instanton in the singular 
gauge. In this case A^ mg vanishes fast at infinity, but becomes pure gauge near x — 0. In 
the region between a small sphere in the vicinity of x = and a large sphere at x = oo 
we have an expression for k in terms of a total derivative, but now for A^ mg the only 
contribution to the topological charge comes from the boundary near x = 0: 



The extra minus sign is due to the fact that the normal to the S 3 (space) at x = points 
inward. Furthermore, A s ^ lg ~ U~ l d p U = —a^Xy/x 2 near x = 0, while A™ g ~ Ud^U^ 1 = 
—a pu x u /x 2 for x ~ oo. There is a second extra minus sign in the evaluation of k from 
the trace of Lorentz generators. As a result k s - mg = k reg , as it should be since A; is a gauge 
invariant object. The gauge transformation which maps A™ g to A s ^ lg transfers the winding 
from a large to a small S* 3 (space). 

B 't Hooft symbols and Euclidean spinors 

In this appendix we give a list of conventions and formulae useful for instanton calculus. 
Let us first discuss the structure of Lorentz algebra so(3, 1) in Minkowski space-time. The 
generators can be represented by L^ u = \{x^d u — Xyd^) and form the algebra [L^, L pa ] = 
-V^p L ua-VuaL^ p +r]^L up + r] up L^ a , with the signature r] pu = diag(-, +, +, +). The spatial 
rotations Jj = \eij k Lj k and boosts Ki = L 0i satisfy the algebra [Jj, Jj] = —eij k J k , [Jj, Kj] = 
[Ki, Jj] = —tijkK k and [Ki, Kj] = e ijk J k . 

There exist two 2-component spinor representations, which we denote by A" and Xa 
(a = 1,2 and a = 1,2). The generators for these spinor representations are a^ v and a^ v , 
where a pv = Kcr^ - a u a^), a pu = \(a p a u - a v a^), with a'f = (r, I), a^^p = (f, -I),H = 
1, 2, 3, 0, and / denotes the identity matrix. The matrices r l with % — 1, 2, 3 are the usual 
Pauli matrices. They consist of a tj = ie tjk r k and a ' 1 = t % for A", and a tj = ie tjk r k and 
a° l = —r % for Xa- The rotation generators a lj are clearly antihermitian, but the boost 
generators are hermitian. 

Under a rotation or boost, both spinors simultaneously transform. Most importantly, the 
two spinor representations are complex. In fact, they are each other's complex conjugate 
up to a similarity transformation: (er^)* = o^cr^o^. The matrices ir k and r k form the 
2x2 defining representation of the group S7(2, C), which is the covering group of SO(3, 1). 

The situation differs for Euclidean space (5^ = diag (+,+,+, +)) with 5*0(4) in- 
stead of the Lorentz group SO(3, 1). Now [L^, L pa ] = S^pL^ + 3 terms, and [Jj, Jj] = 
-Cij k J k ,[Ji,Kj\ = -e ijk K k but [Ki,Kj] = -e ijk J k where obviously Jj = \e ijk L jk and 




(A.11) 
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boosts Ki = Lj 4 . The linear combinations of (ij) and (4, i)-plane rotations 

Ml = ^(J l + K t ), Nl = ^(J z -K t ), (B.l) 

give the algebras of commuting SU (2) subgroups of 5*0(4) = SU (2) x SU (2) in view of the 
anti-hermiticity Ml — —Mi, Nj — — iVj. We now denote the two spinor representations 
by \ a and Xa'- Because M and iV are represented by generators ia M and icrjy which 
act in different spaces, one can transform \ a while Xa' stays fixed, or vice versa. The 
two spinor representations in Euclidean space are each pseudoreal: as we shall discuss 
(<7f")* = a 2 a^a 2 and (a^)* = a 2 a^ u a 2 . 

It is an easy exercise to check that we can represent the operators M and N by 

Mi = fji^ , and N { = r) iiXV , (B.2) 

where we introduced 't Hooft symbols [4] 

Vapu = ^apv $ap$vA ^av^Ap i Vaij ^aiji VajA &aj 
Vapu = ^apu $ap$vA + ^av&Api or Vaij = ^aiji VajA = $aj 

and fj afll/ = (—l) S4 * M+S4 "Vap,v They form a basis of anti-symmetric 4 by 4 matrices and are 
(anti-) self dual in vector indices (ei 2 34 = 1) 

Vapu 2 ^pupaVapa j Vapu 2 ^pupaVapa • 

The ?7-symbols obey the following relations 

^abcVbpuVcpa ^ ppVaua ^uaVapp $ paVaup ^upVapa j 

VapuVapa $ pp^ucr $ pcr^up ^pupa i 

VappVbpa 3ab&pcr ~\~ €-abcVcpa i 

tpuprVaar ^apVaup ^crpVapu ^auVapp j 

VapuVapu = 12 , VapuVbpu = 45 a fe , VappVapa = 35 pa . 



The same holds for 77 except for the terms with e 



pupcn 



VapuVapa $pp$ua $pa$up ^pupa i 

EpupaVaar $apVaup $apVapu ~\~ $auVapp ■ (^"^) 

Obviously VapuVbpu = due to different duality properties. In matrix notation, we have 

[Va,Vb] = -ZtabcVc , [V~a,V~b\ = ~^abcVc , 

{Va,Vb} = -2<5 a & , {Va,Vb} = -25 ab , (B.7) 
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and the two sets of matrices commute, i.e. [r) a ,fjb\ = (this is equivalent to the statement 
that the generators M and N commute). 

The two inequivalent spinor representations of the Euclidean Lorentz algebra are given 

by 

cr^v = \Wn&v - cv#>] , &nv — |[tfy?V - a v a^\ , (B.8) 
in terms of Euclidean matrices 

af = (r a ,i), <V^ = (r a ,-i) , // = 1,2,3,4, (B.9) 

obeying the Clifford algebra + a u a^ = 25 Since contains a^ = €ijkir k and 
o"i4 = — «Tj, while contains a^ = €ijkir k and a a = they are not each others complex 
conjugate, contrary to the Minkowski case. Rather, they are pseudo-real, meaning that 
their complex-conjugates are related to themselves by a similarily transformation 

G *ixv — °2£V0"2; = G2°nvV2 ■ (B.10) 

To prove these, and other, spinor relations, one needs some formulas which we now 
present. As in Minkowski space, also in Euclidean space a^ and a^ are related by trans- 
position 

_ act' - a' a (tj i i \ 

where is obtained from a^pip by raising indices 

- a' a a' 13' a/3 -p. fr> in\ 

= e 1 e 1 a^ //3 (B.12J 

We use everywhere the north-west convention for raising and lowering the spinor indices 

e a % = C , = & , (B.13) 

with e a/ 3 = —e a 'f3',e a f3 = e a/3 , and e a ipi = e a 'P'. However, the relation between a^ and 
a ft under complex conjugation is different (as expected because a = I but a 4 = il). In 
Minkowski space we have (a^)* = a^ a , while in Euclidean space {<jff')* = a^p'a = ^n, a p' 

and (a^T = *f = <^ 

Let us now apply these formulas to give another proof that a^ and a^ are pseudoreal 
in Euclidean space 

1 -B'B 1 & '- B 

= -e 7Q (o-^) 7 (5 e /3<5 = (-ia 2 )(-af l „)(-ia 2 ) = <7 2 <V<7 2 ( B - 14 ) 
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and idem for o pv . 

The two spinor and vector representations of the su(2) algebra are all given in terms of 
anti-hermitian 2x2 matrices a pU) a pv and ir a and they are related by the 't Hooft symbols, 

= iVa^r a , a ya, = ifj apu r a . (B.15) 

Furthermore, a pv is selfdual whereas o pv is anti-selfdual. Some frequently used identities 
are 

p®vp &iiv® p 3/j,p&u ^pvpo® a j " pPvp $ pv® p ^ pp@v tpvptr&cr j 
<Jpv<Jp = $up<Jp $pp&v 4" tpupa&cr i &pu&p = $up&p $pp&u Epupa&a ■ (B.16) 

The Lorentz generators are antisymmetric in vector and symmetric in spinor indices 

<?puaf3 = —Cupa/3 , & pv afi = & pu /3a , (B-17) 

and obey the algebra 

[&pui "per] 2 {5 ppO~ ua -\- 5u<j&pp ^pa^vp ^up^pa} j 

Wpu, &pa} = -2 {SppSua - Spabvp ~ tpvpa} ■ (B.18) 

The same relations hold for a pu but with +e pupa . In spinor algebra the following contrac- 
tions are frequently used 

of&ppp = 25/6 / , = 4 [SpV ~ 2 W} • (B.19) 

so that Cfi)C(2)a = C( a 2 )C(i)a- For hermitean conjugation we define ^fi)C(2)a) = (C(2)a) t (C ( Q i ) ) t 

of )* = {a» a , p y = af . (B.20) 

Throughout the paper we frequently use the following integral formula 

frT 2 / 2\n-m+2 + 2)T(m - W - 2) 

(ar + p l ) ' r[m) 

which converges for m — n > 2. 

C The volume of the gauge orientation moduli space 

The purpose of this appendix 56 is to prove equation (6.15). Let us consider an instanton 
in SU (N) gauge theory. Deformations of this configuration which are still self-dual and 



56 We thank R. Roiban for help in writing this appendix. 
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not a gauge transformation are parametrized by collective coordinates. Constant gauge 
transformations — > U~ 1 A^U preserve self-duality and transversality but not all constant 
SU (N) matrices U change A^. Those U which keep A^ fixed form the stability subgroup H 
of the instanton, hence we want to determine the volume of the coset space SU{N)/H. If 
the instanton is embedded in the lower-right 2x2 submatrix of the N x N SU (N) matrix, 
then H contains the SU (N — 2) subgroup in the left-upper part, and a U(l) subgroup with 
elements exp {OA) where A is the diagonal matrix 



„ i N-2 , / 2 2 \ t „ s 

A= 5V— dia H^'""^ ,M ) ■ (ai) 

All generators of SU (N) (and also all generators of SO{N) discussed below) are normalized 
according to tr T a Tb = — |5 a b, as in the main text. 

At first sight one might expect the range of to be such that the exponents of all entries 
cover the range 2n an integer number of times. However, this is incorrect: only for the 
last two entries of exp {OA) we must require periodicity, because whatever happens in the 
other N — 2 diagonal entries is already contained in the SU{N — 2) part of the stability 
subgroup. Thus all elements h in H are of the form [52] 



h = e 9A g, with geSU{N-2) and < < max = An^j -^-^ . (C.2) 

For N = 3 the range of is larger than required by periodicity of the first iV — 2 entries, for 
N = 4 it corresponds to periodicity of all entries, but for N > 5 the range of is less than 
required for periodicity of the first N-2 entries. 57 Thus H ^ SU{N) x U{\) for N >h. 
The first N — 2 entries of exp {kO m & x A) with integer k are given by exp {—ik-^z^} an d lie 
therefore in the center Z N of SU{N — 2). So, the SU{N) group elements h = exp {OA) g 
with < < max and g in SU{N — 2) form a subgroup H. We shall denote H by 
SU{N - 2) x "[/(l)" where "[/(l)" denotes the part of the U(l) generated by A which lies 
in H. We now use three theorems to evaluate the volume of SU{N)/H: 

SU{N) = Vol {SU{N)/SU{N-2)) 

1 ' SU{N - 2) x "[/(l)" Vol "[/(l)" 

Vo1 SU{N-2) = Vo1 ^(iV-l) Vo1 SU{N-2) ' (C - 3) 



57 For example, consider SU{5) with exp[y y| diag (-§,-§, — |, 1, 1)]- When runs from to Y§47r, 
last two entries repeat, but the first three entries only reach exp(— 47u/3). The first three entries form then 
an element of SU(N — 2) = SU{3), namely they yield an clement z of the center Z 3 . So when ranges 
beyond y / ^§47r, these SU{5) elements can be written as a product of z and expi0^4 with 9 smaller than 



|47r. So, the range of is bounded by \/§47r. 
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(Ill) Vol 



SU(N) 



Vol SU(N) 



SU(N-l) VolSU(N-l) ' 

It is, in fact, easiest to first compute Vol (SU(N)/SU(N — 1)) and then to use this result 
for the evaluation of Vol SU(N)/H (with Vol SU(N) as a bonus). 

In general the volume of a coset manifold G/H is given by V = jW^dx 11 det e™{x) 
where are the coordinates on the coset manifold and e™(x) are the coset vielbeins. 
One begins with "coset representatives" L(x) which are group elements g G G such that 
every group element can be decomposed as g = L(x)h with h G H. We denote the coset 
generators by K m and the subgroup generators by Hi. Then L~ 1 (x)d tl L(x) = e™(x)K m + 
oo l ll (x)Hi. We shall take the generators K m and if; in the fundamental representation 
of SU(N): antihermitian N x N matrices. Under a general coordinate transformation 
from x M to y^(x), the vielbein transforms as a covariant vector with index ji but also 
as a contravariant vector with index m at x = 0. Hence V does (only) depend on the 
choice of the coordinates at the origin. At the origin, L~ 1 d lJL L = e™(0)K m , and we fix the 
normalization of K m by tr — — | for K m in the N x N matrix representation of SU(N). 

To find the volume of SU(N)/SU(N — 1) we note that the group elements of SU(N) 
have a natural action on the space and map a point (z 1 , . . . , z N ) G on the com- 



plex hypersphere Yl 



N 



1 into another point on the complex hypersphere. The 



"south-pole" (0, . . . , 0, 1) is kept invariant by the subgroup SU(N — 1), and points on the 
complex hypersphere are in one-to-one correspondence with the coset representatives L(z) 
of SU(N)/SU(N — 1). We use as generators for SU(N) the generators for SU(N — 1) in 
the upper-left block, and further the following coset generators: N — 1 pairs and T2k+i 
each of them containing only two non-zero elements 

/o ... o \ /o ... \ 



i/2 



\ i/2 ... J 
and further one diagonal generator 



1/2 



\0 -1/2 



(C.4) 



/ 



N 2 -l 



2U N(N -1) 



diag(-l,...,-l,7V-l) 



(C.5) 



(For instance, for SU(3) there are two pairs, proportional to the usual A4 and A5 and A6 
and A7, and the diagonal hypercharge generator A§.) The idea now is to establish a natural 
one-to-one correspondence between points in C N and points in R 2Ar , namely we write all 
points (x 1 , . . . ,x 2N ) in R 2iV as points in C N as follows: {ix l + x 2 



..,ix 2JV - 1 + x 2N ). In 
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particular the south pole (0, 0, 0, 1) in H 2N corresponds to the south pole (0, 0, 0, 1) 
in C N and the sphere Y^,=i{ x% ) 2 — 1 m ^ 2N corresponds to the hypersphere YliLi \ z% \ 2 = 1 
in . Points on the sphere S 2N ~ l in H 2N correspond one-to-one to coset elements of 
SO (2N)/SO{2N - 1). The coset generators of SO{2N)/SO{2N - 1) are antisymmetric 
2N x 2N matrices A t (I = I,..., 2N — 1) with the entry +1/2 in the last column and 
— 1/2 in the last row. The coset element 1 + 5g — 1 + dt T Aj maps the south pole s = 
(0, . . . , 0, 1) in R 2N to a point s + 5s in R 2N where 5s = l/2(dt\ . . . , dt 2N ~\ 0). We know 
how points in correspond to points in H 2N , so we can ask which coset element in 
SU(N)/SU(N — 1) maps the south-pole in C N to the point in which corresponds to 
s + 5s. In the corresponding point is s + 5s with 5s = 1 /2(idt 1 + dt 2 , . . . ,idt 2N ~ r ). 
The coset generators of SU(N)/SU(N — 1) act in C N as follows: g — 1 + dx^K^ maps the 

south-pole s to s + 5s where now 5s = 1/2 (idx 1 + dx 2 , . . . , i\J 2 ^ N ^^ -dx 2N ~ l ). We can cover 
SO(2N)/SO(2N - 1) = S 2 ^ 1 with small patches. Similarly we cover SU(N)/SU(N - 1) 
with small patches. Each patch of S 21 ^' 1 can be brought by the action of a suitable 
coset element to the south-pole, and then we can use the inverse of this group element 
to map this patch back into the manifold SU(N)/ SU(N — 1). In this way both S 2N_1 
and SU (JV) / SU (N — 1) are covered by patches which are in a one-to-one correspondence. 
Each pair of patches has the same ratio of volumes since both patches can be brought 
to the south pole by the same group element and at the south pole the ratio of their 
volumes is the same. To find the ratio of the volumes of S 2N_1 and SU(N)/ SU(N — 1), 
it is then sufficient to consider a small patch near the south pole. Near the south pole 
the vielbeins become unit matrices for coset manifolds, hence the volume of the patches 
near the south-pole is simply the product of the coordinates of these patches. Consider 
then a small patch at the south pole of S 2N_1 with coordinates (dt 1 ,..., dt 2Ar_1 ) and 
volume dt 1 . . . di 2JV_1 . The same patch at the south pole in has coordinates dx M where 

(idt 1 + dt 2 , . . . , idt 2JV_1 ) = ^idx 1 + dx 2 , . . . , i^J ^^^ -dx 21 ^^ 1 ^ . The volume of a patch in 
SU(N)/SU(N - 1) with coordinates dx 1 , . . . , dx 2 ^ 1 is dx 1 . . . dx 2JV-1 . It follows that the 



volume of SU(N)/SU(N - 1) equals the volume of S 2 ^ 1 times 



N 58 



2(7V-1) 



Vol SU{N) _ N 

Vo1 SU{N - 1) " V 2(A^1) Vo1 S ■ (C ' 6) 
From here the evaluation of Vol SU(N)/H is straightforward. Using 



58 This result yields the same answer for (6.15) as [52], but it yields ir N /(N N\) for the volume of the 
complex projective space CP(N) = SU(N + 1)/(SU(N) x 17(1) ) which differs from the result Vol[U(N + 
l)/(U(N) x {7(1))] = VolS 2 ^ given in [90]. 
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where / = 1 if one uses the normalization tr = —2, but I = 2 2N 1 with our normalization 
of tr = — \, we obtain 



iv /ok 

Vol SU(N) = VnT[ ^ 2 2fc - 1 . (C.8) 

We assumed that VolS77(l) = 1 which seems a natural value but must be, and will be, 
justified below. Then 

Vol H = Vol SU(N - 2) Vol "[/(!)" , Vol "[/(!)" = 4vrW ^— ^ , 



Vol ^(AO/tf = I^T3T)!(iV^)! 22iV ~ l22JV " 3 • ( C - 9 ) 

This then produces formula (6.15). 

As an application and check of this analysis let us derive a few relations between the 
volumes of different groups. From now on till the end of this appendix we adopt the 
normalization tr(T a T b ) = —2S ab for the generators of all groups involved. Let us check that 
Vol SU{2) = 2Vol SO{3), Vol SU{4) = 2Vol SO(6) and Vol SO(A) = \ (Vol SU(2)f (the 
latter will follow from SO(A) = SU(2) x SU(2)/Z 2 ). We begin with the usual formula for 
the surface of a sphere with unit radius (given already above for odd N) 

27T (N+l)/2 

Vol S N = . (CIO) 



2N-2 



In particular Vol S 1 = 2tc and 



Vol S 2 = An , Vol S 3 = 2tt 2 , Vol S 4 = fvr 2 , 

VoIS^tt 3 , Vol5 6 = if7r 3 , Vo15 7 =|tt 4 . (C.ll) 

Furthermore Vol SO(2) = 2n since the SO{2) generator with trT 2 = -2 is T = (°^) and 
exp(^T) is an ordinary rotation ( f ° s -^ ^ ) for which < 9 < 2n. The vielbein is unity 
for an abelian group, and thus the Haar measure is 59 simply d#. 

With VolSCKAO = Vol 1 S JV - 1 Vol50(A^ - 1) we obtain Vol SO(l) = 1 and 

Vol SO (2) = 2tt , Vol SO (3) = 8tt 2 , Vol SO (A) = 16tt 4 , 

Vol SO(5) = ^ Tr 6 , Vol SO(Q) = ^ vr 9 . (C.12) 

Now consider SU (2). In the normalization Ti = — ir 1: T 2 = —ir 2 and T 3 = — ir 3 (so that 
tr T a T b = -25 ab ) we find by direct evaluation 60 using Euler angles Vol SU{2) = 2tt 2 . This 



59 0ne clearly must specify the normalization of the generators T a ; for example by choosing T a = 
o - \ 

i 2 ^ J , the range of 6 becomes < 9 < 4ir, but the Haar measure is still d9. 
60 Parametrize g = e aT3 e^ Tl e lT3 , determine the range of a,/3, 7 and compute the group viclbcins. 
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also agrees with (C.6) and (C.8) for N = 2, justifying our assumption that Vol SU(1) = I. 
For higher N we get 

Vol SU{2) = 2tt 2 , Vol SU{3) = V3n 5 , Vol 5C/(4) = ^vr 9 . (C.13) 

The group elements of SU{2) can also be written as g = x A + if • x with (x 4 ) 2 + 
(x) 2 = 1 which defines a sphere 5 3 . Since near the unit element g ~ 1 + if ■ 5x, the 
normalization of the generators is as before, and hence for this parametrization Vol SU (2) = 
2-7T 2 . This is indeed equal to Vol 5 3 . In the mathematical literature one finds the statement 
that Vol SU (2) is twice Vol 50(3) because 577(2) is the double covering group of 5*0 (3). 
However, we have just found that Vol SU{2) = |Vol 50(3). The reason is that in order 
to compare properties of different groups we should normalize the generators such that 
the structure constants are the same (the Lie algebras are the same, although the group 
volumes are not). In other words, we should use the normalization that the adjoint 
representations have the same tr T a Tb. For SU(2) the generators which lead to the same 
commutators as the usual 50(3) rotation generators (with entries +1 and —1) are T a = 
{ — |ti, — |r 2 , — |r 3 }. Then tr T a T b = — ^5 a b. In this normalization, the range of each 
group coordinate is multiplied by 2, leading to Vol SU(2) = 2 3 • 2n 2 = I6ir 2 . Now indeed 
Vol SU{2) = 2 Vol 50(3). 

For SU(A) the generators with the same Lie algebra as 50(6) are the 15 antihermitean 
4x4 matrices |(7m7n -7n7m), ^7m/2, 7m75/2 and 275/2, where 7 m and 75 are the five 
4x4 matrices 7m obeying the Clifford algebra {^MiIn} = 2$mn 61 • Now, tr T a T b = 
—5 a b (for example, tr{(^7i7 2 ) Q7172) } = —1). Recall that originally we had chosen the 
normalization tr T a Tb = —25 a b- We must thus multiply the range of each coordinate by 
a factor y/2, and hence we must multiply our original result for Vol SU(A) by a factor 
(v^) 15 - We find then indeed that the relation Vol SU(A) = 2 Vol 50(6) is fulfilled. 

Finally, we consider the relation 50(4) = SU (2) x SU (2) /Z 2 . (The vector representation 
of 50(4) corresponds to the representation (|, |) of 577(2) x SU(2), but representations 
like (|,0) and (0, |) are not representations of 50(4) and hence we must divide by Z 2 . 
The reasoning is the same as for SU(2) and 50(3), or SU(A) and 50(6).) We choose the 
generators of 50(4) as follows 

T[ +) = -j= (L 14 + L 23 ) , T 2 (+) = i= (Lsi + L 24 ) , T 3 (+) = ( Ll2 + L 34 ) , (C.14) 

and the same but with minus sign denoted by Here L mn equals +1 in the m th column 
and n th row, and is antisymmetric. Clearly tr T a T b = —25 a b- The structure constants follow 
from 

-j= (L12 + L34) , ^= (L u + L 23 ) , = - (L 3 i + L 24 ) , (C.15) 



61 



As Dirac matrices in six dimensions we take j m <£> T2 , 75 ® T2 and I x T3. 
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thus 



T (+) T (+) 

i ' 3 



<+) 



-V2e ljk Tl +} , Tt\Tt ] =-V2e llk T l 



j 



z ijk 1 k 



<-) 



T (+) T (-) 
i ' j 



. (C.16) 



We choose for the generators of SU (2) x SU(2) the representation 



^ = ^i- T ^ ) = l0 7l- (ai7) 

Then we get the same commutation relations as for 5*0(4) generators (C.16); however, the 
generators are normalized differently, namely tr T a T b = —25 a b for SO (4) but tr T a T b = —5^ 
for SU(2). With the normalization tr T a T b = -25 ab we found Vol SU(2) = 2tt 2 . In 
the present normalization we find Vol SU(2) = 2ir 2 (V2) . The relation Vol 5*0(4) = 
| (Vol SU{2)) 2 is now indeed satisfied 

Vol50(4) = 167r 4 = i(Vol5f/(2)) 2 = ^27r 2 (^v / 2) 3 ^ . (C.18) 
D Zero modes and conformal symmetries 

The bosonic collective coordinates obtained for gauge group SU (2) and the one-instanton 
solution could all be identified with rigid symmetries of the action: a M with translations, 
p with scale transformations and 6 a with rigid gauge symmetries. Similarly, the fermionic 
collective coordinates for SU(2)(^ a and fja with a, a — 1,2) could be identified with ordi- 
nary supersymmetry and conformal supersymmetry. However, the full conformal algebra 
in 4 Euclidean dimensions is 50(5, 1), and its generators are P M , K^, D, M^, so one might 
expect that the conformal boost transformations and the Lorentz rotations M^ u pro- 
duce further collective coordinates. As we now show, the transformations due to these 
symmetries can be undone by suitably chosen gauge transformations with constant gauge 
parameters [29]. So there are no further bosonic collective coordinates, as we already know 
from the index theorem discussed in the main text. 

Consider first rigid Lorentz transformations. Here one should not forget that in addition 
to a spin part which acts on the indices of a field they also contain an orbital part that 
acts on the coordinates: = £^ + L^ u . For example, for a spinor one has 5(\ mn )ip = 
^^mnlmn^ + {^mnXmd n )ip . One may check that only with this orbital part present the Dirac 
action is Lorentz invariant. In fact, starting with only the spin part or the orbital part, 
one can find the other part by requiring invariance of the action. We begin by considering 
the field strength = 2a liv p 2 / (x 2 + p 2 ) 2 for an instanton with k — 1 in the regular 
gauge. Under a Lorentz transformation with parameter A MJ , = — X u ^ one has SmA^ = 
XiivA v + X mn x m d n A u . (Note that coordinates transform opposite to fields: 5x m = —X mn x n . 
One may check this transformation rule by showing that the Maxwell action is Lorentz 
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invariant (use the Bianchi identities 62 ), or just by writing down the transformation law for 
a covariant vector in general relativity. The Lagrangian transforms into d p (^C), where 
£M = \P^x p .). The field strength of the instanton transforms as follows 

8mF pu = \p P F pu + \ up Fpp (D.l) 

There is no contribution from the orbital part because x 2 is Lorentz invariant. On the 
other hand, under a gauge transformation with parameter A pa we obtain 63 

<W F /^ = [(Tpu, \Ap a a p J(2p 2 /(x 2 + p 2 ) 2 ) = A ua Fp a - A pa F V(7 (D.2) 

Thus Fp V is invariant under combined Lorentz and gauge transformations with opposite 
parameters, A pcT = — A pcr . Using a pa = irj apa T a , it is clear that the SU{2) gauge parameter 
A a is proportional to r] a p a \p a . Only the selfdual part of A pcr contributes. For an anti- 
instanton we would have needed the anti-self dual part of A pcr . So we have only proven that 
Fp, v is invariant under combined Lorentz and gauge transformation if the Lorentz parameter 
is self dual. However, the anti-self dual part of \ pa leaves F pu separately invariant, without 
the need to add compensating gauge transformations. One can prove this directly, using 
that \p,pF pv = —{*Xpp){*F pu ) and then working out the product of two e-tensors and finally 
antisymmetrizing in pu, but it is already clear from the index structure: F pv is proportional 
to {<Jpu)a'^' while an anti-selfdual \ pa has in spinor notation only undotted indices. 

Let us now repeat this exercise for the gauge field Ap. One finds for the combined 
Lorentz and gauge transformation 

$Ap = \p V A v + \ pa x p d a Ap + [Ap, ^A pa a pa ] (D.3) 

The instanton field Ap for k = 1 in the regular gauge is given by Ap = {—Up U x v )/{x 2 + 
p 2 ). The orbital part with \ pa now contributes, but there is no term dpA a in the gauge 
transformation of Ap since A a is constant. One obtains 



5Ap = \ V A V + J - 2 2* (D-4) 



x 2 + p 2 x 2 + p 2 

For Ap U = —\p U all terms again cancel. Hence, Lorentz symmetry does not yield further 
zero modes. 

In spinor notation these results are almost obvious. In general the selfdual part of a 
curvature reads in spinor notation 

{Fp V )\{op V )J' (D-5) 



62 0ne has 5 M \F* V = F pi/ d p (X up A p ) + F pi/ dp{\ mn x m d n A u ) = F pv \ mn x m d p d n A v . Replacing d p d n A v 
by -d n d v Ap - d v d p A n yields d p {^C). 

63 The usual form of an SU (2) gauge transformation is 5F pv — [F pv , A a (x) ^-], but using riapv'Hbpv = 45 a f, 
and a pa = irj apa T a , this can be rewritten as 5F pv = [F pv , jA pa a pa ] where A pcr = - \r\ apa k a . 
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where u, v are the indices of (r a ) u v , and a', (3' are the spinor indices. If we raise/lower 
indices by e tensors, we get for the instanton solution 

(i^n^W = , ~ ^ + (D.6) 

It is then clear that F^ is invariant under diagonal transformations of SU (2)r and SU (2) gauge , 
and separately invariant under SU(2) L . For an anti-instanton, the roles of SU(2) L and 
SU(2) R are interchanged. 

We come now to the more complicated problem of conformal transformations. A con- 
formal transformation of a field <p with constant parameter a m is given by 64 

5(a m K m )cp = (2a-xx m - a m x 2 )d m ip + 5(2a ■ xD^)<p 

+ 5(2a m x n M^)<p (D.7) 

where D s P in and M^f™ act only on <p(0) and 8(a m K m )(p(x) is by definition a m K m ]. As 

the notation indicates, only the spin parts of the dilatational generator D and the Lorentz 
generators contribute. For example 

5(D s nA, = [A„ D^} = A, , 8{\\ mn M^)A, = \ V A V . (D.8) 

Consider first F^ v . We obtain 

S(a m K m )F^ = (2a ■ xx m - a m x 2 )d m F^ + 4a • xF^ 

+45 {\a m x n M^) with F, v = . (D.9) 

[x + p ) 

We already know that the last term can be canceled by a suitable gauge transformation 
(there are no contributions from Mmn^ because x 2 is Lorentz invariant). The first term 

2 a 2 

gives -4 " 2 ° 2 F F . The first and second term together produce then J^ i 7 )^. But this is 
the opposite of a translation with parameter a m p 2 , namely 



5(a m p 2 P m )F, u = \ a ' X f F, u ; 5(P m ) V = d m ip . (D.10) 
x z + p z 



64 This formula follows from 8(a m K m )tp(x) = [tp(x),a m K m ], and tp(x) = e - p - x (p(0)e p - x with [<p{0),Pp] = 
c^(0). One may then use e Px K m = {e p ' x K m e~ p - X )e p - X and [K m ,P n ] = -2S mn D - 2M mn ; [P m ,D] = 
Pm',[P m ,M rs ] = S mr P s — S ms P r and this yields (D.7). In the same way one may derive the Lorentz 
transformation rule for a spinor ip(x), with both spin and orbital parts, by using that the spin part is 
given by [ip{0), lX mn M mn ] = jA TO „7 m „V(0). One finds then the correct result: 5(^X mn M mn )ip(x) = 
jA m „7 mn V(x) + \ mn x m d n ip(x) . Given the spin part of the transformation rule of the field at the origin, 
one derives in this way the orbital part. In this way one finds that the generators of the conformal 
algebra act as follows on the coordinates: S(P m )x n = S m n 7 S(D)x n = x n ,S(M st )x m — x s 5t m — x t 5 s m and 
5(K. m )x n = 2x m x n — x 2 S m n . Note that coordinates transform contragradiently to fields. For example, 
whereas [6(K m ),5(P n )]<p = -5([K m ,P n ])<p (by definition), one finds [8(K m ),5(P n )]x s = 5([K m , P n ])x s . 
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Thus the following combination of symmetry transformations leaves F pv invariant 

O K m + p CL P m + <5gaugc(A mft = 2a m X n + 2x m CL n ) (D.ll) 

Let us now check that also A p itself is invariant under this combination of symmetries. 
We find by direct evaluation, using A p = (—a py x u )/(x 2 + p 2 ) and (D.9) and (D.2) 

SA f = 2 , 2 A M - <V" Y~2 + 2(1 ' M 

\ ar + p l x z + p z J 

+ d^-dpXa&pa) + L4 M , -a p x CT o- p(J ] . (D.12) 

As in the case of the sum of the first, third and sixth term cancels. This takes care of 
the dilatation term and the denominator of A^. We are left with terms from the numerator, 
and Lorentz and gauge terms 

(2a • xA p + (a pp a p ) ( * - J + (0 - 2x il a v A v ) + 
\x + p J 

-°^a v p 2 \ 2x u ( a v x a o pa - a p x v a pp \ 

-^) +M + ^T?{ + + a f ^ e J (D ' 13) 

The terms denoted by "0" vanish due to x v A v = 0. All other terms cancel in the following 
combinations 

(i) the second, fourth, fifth, and seventh nonvanishing contributions sum up to zero. These 
are the terms with o pp a p . 

(ii) the first and third-but-last nonvanishing term cancel each other. Here conformal boosts 
cancel a gauge term. 

(iii) the remaining Lorentz term —2x p a v A v cancels the remaining gauge term 2x v [x 2 + 
p 2 Y l (a p x p a up ). 

Hence, conformal boosts do not lead to further zero modes either. 
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